Skip to main content

Showing 1–50 of 680 results for author: Baee, S

.
  1. arXiv:2506.05850  [pdf, ps, other

    cs.CL cs.AI

    Cross-lingual Collapse: How Language-Centric Foundation Models Shape Reasoning in Large Language Models

    Authors: Cheonbok Park, Jeonghoon Kim, Joosung Lee, Sanghwan Bae, Jaegul Choo, Kang Min Yoo

    Abstract: We identify \textbf{Cross-lingual Collapse}, a systematic drift in which the chain-of-thought (CoT) of a multilingual language model reverts to its dominant pre-training language even when the prompt is expressed in a different language. Recent large language models (LLMs) with reinforcement learning with verifiable reward (RLVR) have achieved strong logical reasoning performances by exposing thei… ▽ More

    Submitted 9 June, 2025; v1 submitted 6 June, 2025; originally announced June 2025.

    Comments: Preprint

  2. arXiv:2506.03960  [pdf, ps, other

    cs.CG math.MG

    Better Late than Never: the Complexity of Arrangements of Polyhedra

    Authors: Boris Aronov, Sang Won Bae, Sergio Cabello, Otfried Cheong, David Eppstein, Christian Knauer, Raimund Seidel

    Abstract: Let $\mathcal{A}$ be the subdivision of $\mathbb{R}^d$ induced by $m$ convex polyhedra having $n$ facets in total. We prove that $\mathcal{A}$ has combinatorial complexity $O(m^{\lceil d/2 \rceil} n^{\lfloor d/2 \rfloor})$ and that this bound is tight. The bound is mentioned several times in the literature, but no proof for arbitrary dimension has been published before.

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: An earlier version appeared in EuroCG 2025

  3. arXiv:2505.22568  [pdf

    eess.IV cs.CV

    Multipath cycleGAN for harmonization of paired and unpaired low-dose lung computed tomography reconstruction kernels

    Authors: Aravind R. Krishnan, Thomas Z. Li, Lucas W. Remedios, Michael E. Kim, Chenyu Gao, Gaurav Rudravaram, Elyssa M. McMaster, Adam M. Saunders, Shunxing Bao, Kaiwen Xu, Lianrui Zuo, Kim L. Sandler, Fabien Maldonado, Yuankai Huo, Bennett A. Landman

    Abstract: Reconstruction kernels in computed tomography (CT) affect spatial resolution and noise characteristics, introducing systematic variability in quantitative imaging measurements such as emphysema quantification. Choosing an appropriate kernel is therefore essential for consistent quantitative analysis. We propose a multipath cycleGAN model for CT kernel harmonization, trained on a mixture of paired… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

  4. arXiv:2505.19603  [pdf, ps, other

    cs.CV cs.LG

    Rep3D: Re-parameterize Large 3D Kernels with Low-Rank Receptive Modeling for Medical Imaging

    Authors: Ho Hin Lee, Quan Liu, Shunxing Bao, Yuankai Huo, Bennett A. Landman

    Abstract: In contrast to vision transformers, which model long-range dependencies through global self-attention, large kernel convolutions provide a more efficient and scalable alternative, particularly in high-resolution 3D volumetric settings. However, naively increasing kernel size often leads to optimization instability and degradation in performance. Motivated by the spatial bias observed in effective… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: 14 pages

  5. arXiv:2505.19451  [pdf, ps, other

    math.AG math.CV

    Algebraic Zhou valuations

    Authors: Shijie Bao, Qi'an Guan, Lin Zhou

    Abstract: In this paper, we generalize Zhou valuations, originally defined on complex domains, to the framework of general schemes. We demonstrate that an algebraic version of the Jonsson--Mustaţă conjecture is equivalent to the statement that every Zhou valuation is quasi-monomial. By introducing a mixed version of jumping numbers and Tian functions associated with valuations, we obtain characterizations o… ▽ More

    Submitted 5 June, 2025; v1 submitted 25 May, 2025; originally announced May 2025.

    Comments: 43 pages. All comments are welcome! We fix some typos in the 2rd version

    MSC Class: 14F18; 12J20; 14B05; 32U05; 32U35

  6. arXiv:2505.17818  [pdf, other

    cs.AI cs.CL

    PatientSim: A Persona-Driven Simulator for Realistic Doctor-Patient Interactions

    Authors: Daeun Kyung, Hyunseung Chung, Seongsu Bae, Jiho Kim, Jae Ho Sohn, Taerim Kim, Soo Kyung Kim, Edward Choi

    Abstract: Doctor-patient consultations require multi-turn, context-aware communication tailored to diverse patient personas. Training or evaluating doctor LLMs in such settings requires realistic patient interaction systems. However, existing simulators often fail to reflect the full range of personas seen in clinical practice. To address this, we introduce PatientSim, a patient simulator that generates rea… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: 9 pages for main text, 4 pages for references, 27 pages for supplementary materials

  7. arXiv:2505.11131  [pdf, ps, other

    cs.CV cs.AI

    One Image is Worth a Thousand Words: A Usability Preservable Text-Image Collaborative Erasing Framework

    Authors: Feiran Li, Qianqian Xu, Shilong Bao, Zhiyong Yang, Xiaochun Cao, Qingming Huang

    Abstract: Concept erasing has recently emerged as an effective paradigm to prevent text-to-image diffusion models from generating visually undesirable or even harmful content. However, current removal methods heavily rely on manually crafted text prompts, making it challenging to achieve a high erasure (efficacy) while minimizing the impact on other benign concepts (usability). In this paper, we attribute t… ▽ More

    Submitted 26 May, 2025; v1 submitted 16 May, 2025; originally announced May 2025.

    Comments: This paper has been accepeted to ICML 2025

  8. arXiv:2505.10023  [pdf, ps, other

    hep-ph hep-ex

    Light Axion-Like Particles at Future Lepton Colliders

    Authors: Shou-shan Bao, Yang Ma, Yongcheng Wu, Keping Xie, Hong Zhang

    Abstract: Axion-like particles (ALPs) are well-motivated extensions of the Standard Model (SM) that appear in many new physics scenarios, with masses spanning a broad range. In this work, we systematically study the production and detection prospects of light ALPs at future lepton colliders, including electron-positron and multi-TeV muon colliders. At lepton colliders, light ALPs can be produced in associat… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

    Comments: 46 pages, 20 figures

    Report number: COMETA-2025-02, IRMP-CP3-25-09, MSUHEP-25-002, CPTNP-2025-011

  9. arXiv:2505.09091  [pdf, ps, other

    cs.SD cs.AI cs.CV cs.LG eess.AS

    DPN-GAN: Inducing Periodic Activations in Generative Adversarial Networks for High-Fidelity Audio Synthesis

    Authors: Zeeshan Ahmad, Shudi Bao, Meng Chen

    Abstract: In recent years, generative adversarial networks (GANs) have made significant progress in generating audio sequences. However, these models typically rely on bandwidth-limited mel-spectrograms, which constrain the resolution of generated audio sequences, and lead to mode collapse during conditional generation. To address this issue, we propose Deformable Periodic Network based GAN (DPN-GAN), a nov… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

    Journal ref: IEEE Access, vol. 13, pp. 69324-69340, 2025

  10. arXiv:2505.08809  [pdf, ps, other

    cs.CR cs.AI

    MixBridge: Heterogeneous Image-to-Image Backdoor Attack through Mixture of Schrödinger Bridges

    Authors: Shixi Qin, Zhiyong Yang, Shilong Bao, Shi Wang, Qianqian Xu, Qingming Huang

    Abstract: This paper focuses on implanting multiple heterogeneous backdoor triggers in bridge-based diffusion models designed for complex and arbitrary input distributions. Existing backdoor formulations mainly address single-attack scenarios and are limited to Gaussian noise input models. To fill this gap, we propose MixBridge, a novel diffusion Schrödinger bridge (DSB) framework to cater to arbitrary inpu… ▽ More

    Submitted 26 May, 2025; v1 submitted 12 May, 2025; originally announced May 2025.

  11. arXiv:2505.06698  [pdf, other

    cs.CL

    From Rankings to Insights: Evaluation Should Shift Focus from Leaderboard to Feedback

    Authors: Zongqi Wang, Tianle Gu, Chen Gong, Xin Tian, Siqi Bao, Yujiu Yang

    Abstract: Automatic evaluation benchmarks such as MT-Bench, Arena-Hard, and Auto-Arena are seeing growing adoption for the evaluation of Large Language Models (LLMs). Existing research has primarily focused on approximating human-based model rankings using limited data and LLM-as-a-Judge. However, the fundamental premise of these studies, which attempts to replicate human rankings, is flawed. Specifically,… ▽ More

    Submitted 16 May, 2025; v1 submitted 10 May, 2025; originally announced May 2025.

  12. arXiv:2505.05180  [pdf, other

    cs.LG

    OpenworldAUC: Towards Unified Evaluation and Optimization for Open-world Prompt Tuning

    Authors: Cong Hua, Qianqian Xu, Zhiyong Yang, Zitai Wang, Shilong Bao, Qingming Huang

    Abstract: Prompt tuning adapts Vision-Language Models like CLIP to open-world tasks with minimal training costs. In this direction, one typical paradigm evaluates model performance separately on known classes (i.e., base domain) and unseen classes (i.e., new domain). However, real-world scenarios require models to handle inputs without prior domain knowledge. This practical challenge has spurred the develop… ▽ More

    Submitted 8 May, 2025; originally announced May 2025.

    Comments: This paper has been accepted by ICML2025

  13. arXiv:2505.05095  [pdf, other

    hep-ex

    Axion Dark Matter Search with Near-KSVZ Sensitivity Using the TM$_{020}$ Mode

    Authors: Sungjae Bae, Junu Jeong, Younggeun Kim, SungWoo Youn, Jinsu Kim, Arjan F. van Loo, Yasunobu Nakamura, Seonjeong Oh, Taehyeon Seong, Sergey Uchaikin, Jihn E. Kim, Yannis K. Semertzidis

    Abstract: Dark matter remains one of the most profound mysteries in modern physics, with axions, a hypothetical particle proposed to resolve the strong CP problem, standing as a compelling candidate. Among various experimental strategies, cavity haloscopes currently offer the most sensitive method to detect axions, though their searches have largely been confined to axion masses below 10 $μ$eV. However, rec… ▽ More

    Submitted 8 May, 2025; originally announced May 2025.

    Comments: 7 pages, 4 figures

  14. arXiv:2505.04847  [pdf, other

    cs.CL cs.AI

    Benchmarking LLM Faithfulness in RAG with Evolving Leaderboards

    Authors: Manveer Singh Tamber, Forrest Sheng Bao, Chenyu Xu, Ge Luo, Suleman Kazi, Minseok Bae, Miaoran Li, Ofer Mendelevitch, Renyi Qu, Jimmy Lin

    Abstract: Hallucinations remain a persistent challenge for LLMs. RAG aims to reduce hallucinations by grounding responses in contexts. However, even when provided context, LLMs still frequently introduce unsupported information or contradictions. This paper presents our efforts to measure LLM hallucinations with a focus on summarization tasks, assessing how often various LLMs introduce hallucinations when s… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

  15. arXiv:2505.03695  [pdf, other

    cs.RO eess.SY

    Frenet Corridor Planner: An Optimal Local Path Planning Framework for Autonomous Driving

    Authors: Faizan M. Tariq, Zheng-Hang Yeh, Avinash Singh, David Isele, Sangjae Bae

    Abstract: Motivated by the requirements for effectiveness and efficiency, path-speed decomposition-based trajectory planning methods have widely been adopted for autonomous driving applications. While a global route can be pre-computed offline, real-time generation of adaptive local paths remains crucial. Therefore, we present the Frenet Corridor Planner (FCP), an optimization-based local path planning stra… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

    Comments: 8 pages, 10 figures - Presented at 2025 IEEE 36th Intelligent Vehicles Symposium (IV)

  16. arXiv:2505.03060  [pdf

    cond-mat.mtrl-sci

    Atom-by-atom Imaging of Moiré Phasons using Electron Ptychography

    Authors: Yichao Zhang, Ballal Ahammed, Sang Hyun Bae, Chia-Hao Lee, Jeffrey Huang, Mohammad Abir Hossain, Tawfiqur Rakib, Arend van der Zande, Elif Ertekin, Pinshane Y. Huang

    Abstract: Twisted 2D materials exhibit unique vibrational modes called moiré phonons, which arise from the moiré superlattice. Here, we demonstrate atom-by-atom imaging of phasons, an ultrasoft class of moiré phonons in twisted bilayer WSe2. Using ultrahigh-resolution (<15 pm) electron ptychography, we image the size and shape of each atom to extract time-averaged vibrational amplitudes as a function of twi… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

  17. arXiv:2505.02830  [pdf, ps, other

    cs.CV cs.CL

    AOR: Anatomical Ontology-Guided Reasoning for Medical Large Multimodal Model in Chest X-Ray Interpretation

    Authors: Qingqiu Li, Zihang Cui, Seongsu Bae, Jilan Xu, Runtian Yuan, Yuejie Zhang, Rui Feng, Quanli Shen, Xiaobo Zhang, Junjun He, Shujun Wang

    Abstract: Chest X-rays (CXRs) are the most frequently performed imaging examinations in clinical settings. Recent advancements in Large Multimodal Models (LMMs) have enabled automated CXR interpretation, enhancing diagnostic accuracy and efficiency. However, despite their strong visual understanding, current Medical LMMs (MLMMs) still face two major challenges: (1) Insufficient region-level understanding an… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

  18. arXiv:2505.02069  [pdf, other

    cs.LG stat.ML

    Neural Logistic Bandits

    Authors: Seoungbin Bae, Dabeen Lee

    Abstract: We study the problem of neural logistic bandits, where the main task is to learn an unknown reward function within a logistic link function using a neural network. Existing approaches either exhibit unfavorable dependencies on $κ$, where $1/κ$ represents the minimum variance of reward distributions, or suffer from direct dependence on the feature dimension $d$, which can be huge in neural network-… ▽ More

    Submitted 4 May, 2025; originally announced May 2025.

  19. arXiv:2504.18935  [pdf, other

    astro-ph.CO astro-ph.HE gr-qc hep-ph hep-th

    Superradiant dark matter production from primordial black holes: Impact of multiple modes and gravitational wave emission

    Authors: Nayun Jia, Shou-Shan Bao, Chen Zhang, Hong Zhang, Xin Zhang

    Abstract: Rotating primordial black holes (PBHs) in the early universe can emit particles through superradiance, a process particularly efficient when the particle's Compton wavelength is comparable to the PBH's gravitational radius. Superradiance leads to an exponential growth of particle occupation numbers in gravitationally bound states. We present an analysis of heavy bosonic dark matter (DM) production… ▽ More

    Submitted 26 April, 2025; originally announced April 2025.

    Comments: 29 pages, 3 figures

  20. arXiv:2504.18539  [pdf, other

    eess.AS cs.LG cs.MM cs.SD

    Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation

    Authors: Sungnyun Kim, Sungwoo Cho, Sangmin Bae, Kangwook Jang, Se-Young Yun

    Abstract: Audio-visual speech recognition (AVSR) incorporates auditory and visual modalities to improve recognition accuracy, particularly in noisy environments where audio-only speech systems are insufficient. While previous research has largely addressed audio disruptions, few studies have dealt with visual corruptions, e.g., lip occlusions or blurred videos, which are also detrimental. To address this re… ▽ More

    Submitted 30 April, 2025; v1 submitted 23 January, 2025; originally announced April 2025.

    Comments: ICLR 2025; 22 pages, 6 figures, 14 tables

  21. arXiv:2504.17364  [pdf, ps, other

    cs.CV

    I-INR: Iterative Implicit Neural Representations

    Authors: Ali Haider, Muhammad Salman Ali, Maryam Qamar, Tahir Khalil, Soo Ye Kim, Jihyong Oh, Enzo Tartaglione, Sung-Ho Bae

    Abstract: Implicit Neural Representations (INRs) have revolutionized signal processing and computer vision by modeling signals as continuous, differentiable functions parameterized by neural networks. However, their inherent formulation as a regression problem makes them prone to regression to the mean, limiting their ability to capture fine details, retain high-frequency information, and handle noise effec… ▽ More

    Submitted 9 June, 2025; v1 submitted 24 April, 2025; originally announced April 2025.

  22. arXiv:2504.16904  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Observation of Double Hysteresis in CoFe$_2$O$_4$/MnFe$_2$O$_4$ Core/Shell Nanoparticles and Its Contribution to AC Heat Induction

    Authors: Jie Wang, Hyungsub Kim, Ji-wook Kim, HyeongJoo Seo, Satoshi Ota, Chun-Yeol You, Yasushi Takemura, Seongtae Bae

    Abstract: Magnetic core/shell nanoparticles are promising candidates for magnetic hyperthermia due to its high AC magnetic heat induction (specific loss power (SLP)). It's widely accepted that magnetic exchange-coupling between core and shell plays the crucial role in enhancing SLP of magnetic core/shell nanoparticles. However, the physical contribution of exchange coupling to SLP has not been systematicall… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

  23. arXiv:2504.13280  [pdf

    quant-ph cond-mat.mes-hall cond-mat.mtrl-sci

    Atomic-scale imaging and charge state manipulation of NV centers by scanning tunneling microscopy

    Authors: Arjun Raghavan, Seokjin Bae, Nazar Delegan, F. Joseph Heremans, Vidya Madhavan

    Abstract: Nitrogen-vacancy (NV) centers in diamond are among the most promising solid-state qubit candidates, owing to their exceptionally long spin coherence times, efficient spin-photon coupling, room-temperature operation, and steadily advancing fabrication and integration techniques. Despite significant progress in the field, atomic-scale characterization and control of individual NV centers have remain… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

    Comments: 27 pages; 4 main figures and 10 supplementary figures

  24. arXiv:2504.12616  [pdf, other

    cs.RO eess.SY

    Graph-based Path Planning with Dynamic Obstacle Avoidance for Autonomous Parking

    Authors: Farhad Nawaz, Minjun Sung, Darshan Gadginmath, Jovin D'sa, Sangjae Bae, David Isele, Nadia Figueroa, Nikolai Matni, Faizan M. Tariq

    Abstract: Safe and efficient path planning in parking scenarios presents a significant challenge due to the presence of cluttered environments filled with static and dynamic obstacles. To address this, we propose a novel and computationally efficient planning strategy that seamlessly integrates the predictions of dynamic obstacles into the planning process, ensuring the generation of collision-free paths. O… ▽ More

    Submitted 7 May, 2025; v1 submitted 16 April, 2025; originally announced April 2025.

    Comments: IEEE Intelligent Vehicles Symposium 2025

  25. arXiv:2504.12185  [pdf, other

    cs.CL cs.AI

    SALAD: Improving Robustness and Generalization through Contrastive Learning with Structure-Aware and LLM-Driven Augmented Data

    Authors: Suyoung Bae, Hyojun Kim, YunSeok Choi, Jee-Hyong Lee

    Abstract: In various natural language processing (NLP) tasks, fine-tuning Pre-trained Language Models (PLMs) often leads to the issue of spurious correlations, which negatively impacts performance, particularly when dealing with out-of-distribution data. To address this problem, we propose SALAD}(Structure Aware and LLM-driven Augmented Data), a novel approach designed to enhance model robustness and genera… ▽ More

    Submitted 16 April, 2025; originally announced April 2025.

    Comments: Accepted to NAACL 2025 main. 15 pages, 4 figures

  26. arXiv:2504.06960  [pdf, other

    cs.CG

    Higher-Order Color Voronoi Diagrams and the Colorful Clarkson-Shor Framework

    Authors: Sang Won Bae, Nicolau Oliver, Evanthia Papadopoulou

    Abstract: Given a set $S$ of $n$ colored sites, each $s\in S$ associated with a distance-to-site function $δ_s \colon \mathbb{R}^2 \to \mathbb{R}$, we consider two distance-to-color functions for each color: one takes the minimum of $δ_s$ for sites $s\in S$ in that color and the other takes the maximum. These two sets of distance functions induce two families of higher-order Voronoi diagrams for colors in t… ▽ More

    Submitted 9 April, 2025; originally announced April 2025.

    Comments: 43 pages, 11 figures

  27. arXiv:2504.06008  [pdf, ps, other

    nucl-ex

    Impact of newly measured $β$\nobreakdash-delayed neutron emitters around \myisoSimp{78}{Ni} on light element nucleosynthesis in the neutrino-wind following a neutron star merger

    Authors: A. Tolosa-Delgado, J. L. Tain, M. Reichert, A. Arcones, M. Eichler, B. C. Rasco, N. T. Brewer, K. P. Rykaczewski, R. Yokoyama, R. Grzywacz, I. Dillmann, J. Agramunt, D. S. Ahn, A. Algora, H. Baba, S. Bae, C. G. Bruno, R. Caballero Folch, F. Calvino, P. J. Coleman-Smith, G. Cortes, T. Davinson, C. Domingo-Pardo, A. Estrade, N. Fukuda , et al. (49 additional authors not shown)

    Abstract: Neutron emission probabilities and half-lives of 37 beta-delayed neutron emitters from 75Ni to 92Br were measured at the RIKEN Nishina Center in Japan, including 11 one-neutron and 13 two-neutron emission probabilities and 6 half-lives measured for the first time, which supersede theoretical estimates. These nuclei lie in the path of the weak r-process occurring in neutrino-driven winds from the a… ▽ More

    Submitted 8 April, 2025; originally announced April 2025.

  28. arXiv:2504.03380  [pdf, other

    cs.CL cs.AI

    Online Difficulty Filtering for Reasoning Oriented Reinforcement Learning

    Authors: Sanghwan Bae, Jiwoo Hong, Min Young Lee, Hanbyul Kim, JeongYeon Nam, Donghyun Kwak

    Abstract: Reasoning-Oriented Reinforcement Learning (RORL) enhances the reasoning ability of Large Language Models (LLMs). However, due to the sparsity of rewards in RORL, effective training is highly dependent on the selection of problems of appropriate difficulty. Although curriculum learning attempts to address this by adjusting difficulty, it often relies on static schedules, and even recent online filt… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

  29. arXiv:2504.00698  [pdf

    cs.CL cs.AI cs.LG

    Command A: An Enterprise-Ready Large Language Model

    Authors: Team Cohere, :, Aakanksha, Arash Ahmadian, Marwan Ahmed, Jay Alammar, Milad Alizadeh, Yazeed Alnumay, Sophia Althammer, Arkady Arkhangorodsky, Viraat Aryabumi, Dennis Aumiller, Raphaël Avalos, Zahara Aviv, Sammie Bae, Saurabh Baji, Alexandre Barbet, Max Bartolo, Björn Bebensee, Neeral Beladia, Walter Beller-Morales, Alexandre Bérard, Andrew Berneshawi, Anna Bialas, Phil Blunsom , et al. (205 additional authors not shown)

    Abstract: In this report we describe the development of Command A, a powerful large language model purpose-built to excel at real-world enterprise use cases. Command A is an agent-optimised and multilingual-capable model, with support for 23 languages of global business, and a novel hybrid architecture balancing efficiency with top of the range performance. It offers best-in-class Retrieval Augmented Genera… ▽ More

    Submitted 14 April, 2025; v1 submitted 1 April, 2025; originally announced April 2025.

    Comments: 55 pages

  30. arXiv:2503.24109  [pdf, ps, other

    math.CV

    Demailly's approximation of general weights

    Authors: Shijie Bao, Qi'an Guan

    Abstract: In this note, we demonstrate the convergence of the Demailly approximation of a general (weakly) upper semi-continuous weight.

    Submitted 2 April, 2025; v1 submitted 31 March, 2025; originally announced March 2025.

    Comments: 4 pages

    MSC Class: 32A25; 32U05

  31. arXiv:2503.23394  [pdf, other

    q-bio.NC cs.AI

    Spatiotemporal Learning of Brain Dynamics from fMRI Using Frequency-Specific Multi-Band Attention for Cognitive and Psychiatric Applications

    Authors: Sangyoon Bae, Junbeom Kwon, Shinjae Yoo, Jiook Cha

    Abstract: Understanding how the brain's complex nonlinear dynamics give rise to adaptive cognition and behavior is a central challenge in neuroscience. These dynamics exhibit scale-free and multifractal properties, influencing the reconfiguration of neural networks. However, conventional neuroimaging models are constrained by linear and stationary assumptions, limiting their ability to capture these process… ▽ More

    Submitted 30 March, 2025; originally announced March 2025.

  32. arXiv:2503.20823  [pdf, other

    cs.CR

    Playing the Fool: Jailbreaking LLMs and Multimodal LLMs with Out-of-Distribution Strategy

    Authors: Joonhyun Jeong, Seyun Bae, Yeonsung Jung, Jaeryong Hwang, Eunho Yang

    Abstract: Despite the remarkable versatility of Large Language Models (LLMs) and Multimodal LLMs (MLLMs) to generalize across both language and vision tasks, LLMs and MLLMs have shown vulnerability to jailbreaking, generating textual outputs that undermine safety, ethical, and bias standards when exposed to harmful or sensitive inputs. With the recent advancement of safety alignment via preference-tuning fr… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

    Comments: Accepted at CVPR2025

  33. arXiv:2503.19426  [pdf, other

    cs.CL cs.AI cs.CY

    DeCAP: Context-Adaptive Prompt Generation for Debiasing Zero-shot Question Answering in Large Language Models

    Authors: Suyoung Bae, YunSeok Choi, Jee-Hyong Lee

    Abstract: While Large Language Models (LLMs) excel in zero-shot Question Answering (QA), they tend to expose biases in their internal knowledge when faced with socially sensitive questions, leading to a degradation in performance. Existing zero-shot methods are efficient but fail to consider context and prevent bias propagation in the answers. To address this, we propose DeCAP, a method for debiasing LLMs u… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

    Comments: Accepted to NAACL 2025 main. 20 pages, 3 figures

  34. arXiv:2503.19378  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Rapid Vapor-Assisted Solution Process of Metal-Organic Chalcogenides for High-Performance Light-Emitting Diodes

    Authors: Sang-Hyun Chin, Daseul Lee, Donggyu Lee, Kwanghyun Chung, Eunjong Yoo, Tong-Il Kim, Su Hwan Lee, Sang Woo Bae, Young-Hoon Kim, Yeonjin Yi

    Abstract: Metal-organic chalcogenides (MOCs), robust crystalline assemblies composed of coinage metals, chalcogens and organic ligands, are typically synthesized via prolonged, high temperature tarnishing of vacuum-deposited metal films with organochalcogen precursors. The prolonged exposure to high temperatures and the necessity for direct vacuum deposition of silver can induce damage to the underlying fil… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

    Comments: 23 pages with 3 figures and Supporting info

  35. arXiv:2503.12593  [pdf, other

    eess.IV cs.AI cs.LG physics.bio-ph q-bio.QM

    Fourier-Based 3D Multistage Transformer for Aberration Correction in Multicellular Specimens

    Authors: Thayer Alshaabi, Daniel E. Milkie, Gaoxiang Liu, Cyna Shirazinejad, Jason L. Hong, Kemal Achour, Frederik Görlitz, Ana Milunovic-Jevtic, Cat Simmons, Ibrahim S. Abuzahriyeh, Erin Hong, Samara Erin Williams, Nathanael Harrison, Evan Huang, Eun Seok Bae, Alison N. Killilea, David G. Drubin, Ian A. Swinburne, Srigokul Upadhyayula, Eric Betzig

    Abstract: High-resolution tissue imaging is often compromised by sample-induced optical aberrations that degrade resolution and contrast. While wavefront sensor-based adaptive optics (AO) can measure these aberrations, such hardware solutions are typically complex, expensive to implement, and slow when serially mapping spatially varying aberrations across large fields of view. Here, we introduce AOViFT (Ada… ▽ More

    Submitted 23 May, 2025; v1 submitted 16 March, 2025; originally announced March 2025.

    Comments: 55 pages, 6 figures, 26 si figures, 8 si tables

  36. arXiv:2503.10941  [pdf, other

    cs.AI cs.LG cs.RO

    Graph-Grounded LLMs: Leveraging Graphical Function Calling to Minimize LLM Hallucinations

    Authors: Piyush Gupta, Sangjae Bae, David Isele

    Abstract: The adoption of Large Language Models (LLMs) is rapidly expanding across various tasks that involve inherent graphical structures. Graphs are integral to a wide range of applications, including motion planning for autonomous vehicles, social networks, scene understanding, and knowledge graphs. Many problems, even those not initially perceived as graph-based, can be effectively addressed through gr… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

  37. arXiv:2503.10167  [pdf, other

    cs.CL

    "Well, Keep Thinking": Enhancing LLM Reasoning with Adaptive Injection Decoding

    Authors: Hyunbin Jin, Je Won Yeom, Seunghyun Bae, Taesup Kim

    Abstract: Large language models (LLMs) exhibit strong reasoning abilities, often attributed to few-shot or zero-shot chain-of-thought (CoT) prompting. While effective, these methods require labor-intensive prompt engineering, raising the question of whether reasoning can be induced without reliance on explicit prompts. In this work, we unlock the reasoning capabilities of LLMs without explicit prompting. In… ▽ More

    Submitted 17 March, 2025; v1 submitted 13 March, 2025; originally announced March 2025.

  38. arXiv:2503.08100  [pdf, other

    cs.HC

    Predicting Volleyball Season Performance Using Pre-Season Wearable Data and Machine Learning

    Authors: Melik Ozolcer, Tongze Zhang, Sang Won Bae

    Abstract: Predicting performance outcomes has the potential to transform training approaches, inform coaching strategies, and deepen our understanding of the factors that contribute to athletic success. Traditional non-automated data analysis in sports are often difficult to scale. To address this gap, this study analyzes factors influencing athletic performance by leveraging passively collected sensor data… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

    Comments: 11 pages, 4 figures, 8 tables

  39. arXiv:2503.06988  [pdf, ps, other

    quant-ph

    Formulas for Mutually Orthogonal Quantum States in Two-Qubit Systems: Orthogonal Schmidt Decompositions

    Authors: Yonghae Lee, Youngho Min, Sunghyun Bae, Youngrong Lim

    Abstract: We present Schmidt decomposition formulas for mutually orthogonal two-qubit pure states and classify orthonormal sets based on their entanglement structure. First, we derive explicit Schmidt decomposition formulas for any pure state and extend them to two orthogonal pure states. For three mutually orthogonal states, we provide formulas for specific cases and discuss the challenges of obtaining ana… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

    Comments: 14 pages

  40. arXiv:2503.06514  [pdf, other

    cs.CL cs.AI cs.CV cs.LG

    GFlowVLM: Enhancing Multi-step Reasoning in Vision-Language Models with Generative Flow Networks

    Authors: Haoqiang Kang, Enna Sachdeva, Piyush Gupta, Sangjae Bae, Kwonjoon Lee

    Abstract: Vision-Language Models (VLMs) have recently shown promising advancements in sequential decision-making tasks through task-specific fine-tuning. However, common fine-tuning methods, such as Supervised Fine-Tuning (SFT) and Reinforcement Learning (RL) techniques like Proximal Policy Optimization (PPO), present notable limitations: SFT assumes Independent and Identically Distributed (IID) data, while… ▽ More

    Submitted 25 March, 2025; v1 submitted 9 March, 2025; originally announced March 2025.

    Journal ref: CVPR 2025

  41. arXiv:2503.06463  [pdf, other

    cs.HC

    AXAI-CDSS : An Affective Explainable AI-Driven Clinical Decision Support System for Cannabis Use

    Authors: Tongze Zhang, Tammy Chung, Anind Dey, Sang Won Bae

    Abstract: As cannabis use has increased in recent years, researchers have come to rely on sophisticated machine learning models to predict cannabis use behavior and its impact on health. However, many artificial intelligence (AI) models lack transparency and interpretability due to their opaque nature, limiting their trust and adoption in real-world medical applications, such as clinical decision support sy… ▽ More

    Submitted 9 March, 2025; originally announced March 2025.

  42. arXiv:2503.02241  [pdf

    cs.CV cs.LG

    Unsupervised Waste Classification By Dual-Encoder Contrastive Learning and Multi-Clustering Voting (DECMCV)

    Authors: Kui Huang, Mengke Song, Shuo Ba, Ling An, Huajie Liang, Huanxi Deng, Yang Liu, Zhenyu Zhang, Chichun Zhou

    Abstract: Waste classification is crucial for improving processing efficiency and reducing environmental pollution. Supervised deep learning methods are commonly used for automated waste classification, but they rely heavily on large labeled datasets, which are costly and inefficient to obtain. Real-world waste data often exhibit category and style biases, such as variations in camera angles, lighting condi… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

  43. arXiv:2502.20636  [pdf, ps, other

    cs.RO eess.SY

    Delayed-Decision Motion Planning in the Presence of Multiple Predictions

    Authors: David Isele, Alexandre Miranda Anon, Faizan M. Tariq, Goro Yeh, Avinash Singh, Sangjae Bae

    Abstract: Reliable automated driving technology is challenged by various sources of uncertainties, in particular, behavioral uncertainties of traffic agents. It is common for traffic agents to have intentions that are unknown to others, leaving an automated driving car to reason over multiple possible behaviors. This paper formalizes a behavior planning scheme in the presence of multiple possible futures wi… ▽ More

    Submitted 6 June, 2025; v1 submitted 27 February, 2025; originally announced February 2025.

  44. arXiv:2502.19457  [pdf, other

    cs.GR

    Compression in 3D Gaussian Splatting: A Survey of Methods, Trends, and Future Directions

    Authors: Muhammad Salman Ali, Chaoning Zhang, Marco Cagnazzo, Giuseppe Valenzise, Enzo Tartaglione, Sung-Ho Bae

    Abstract: 3D Gaussian Splatting (3DGS) has recently emerged as a pioneering approach in explicit scene rendering and computer graphics. Unlike traditional neural radiance field (NeRF) methods, which typically rely on implicit, coordinate-based models to map spatial coordinates to pixel values, 3DGS utilizes millions of learnable 3D Gaussians. Its differentiable rendering technique and inherent capability fo… ▽ More

    Submitted 26 February, 2025; originally announced February 2025.

  45. arXiv:2502.10808  [pdf, other

    nucl-ex

    First measurement of 87Rb(α, xn) cross sections at weak r-process energies in supernova ν-driven ejecta to investigate elemental abundances in low-metallicity stars

    Authors: C. Fougères, M. L. Avila, A. Psaltis, M. Anastasiou, S. Bae, L. Balliet, K. Bhatt, L. Dienis, H. Jayatissa, V. Karayonchev, P. Mohr, F. Montes, D. Neto, F. de Oliveira Santos, W. -J. Ong, K. E. Rehm, W. Reviol, D. Santiago-Gonzalez, N. Sensharma, R. S. Sidhu, I. A. Tolstukhin

    Abstract: Observed abundances of Z ~ 40 elements in metal-poor stars vary from star to star, indicating that the rapid and slow neutron capture processes may not contribute alone to the synthesis of elements beyond iron. The weak r-process was proposed to produce Z ~ 40 elements in a subset of old stars. Thought to occur in the ν-driven ejecta of a core-collapse supernova, (α, xn) reactions would drive the… ▽ More

    Submitted 15 February, 2025; originally announced February 2025.

    Comments: 15 pages, 7 figures. Preprint version before peer review or editing, as submitted to Astrophysical Journal

  46. arXiv:2502.10447  [pdf, other

    eess.AS cs.CL cs.LG

    MoHAVE: Mixture of Hierarchical Audio-Visual Experts for Robust Speech Recognition

    Authors: Sungnyun Kim, Kangwook Jang, Sangmin Bae, Sungwoo Cho, Se-Young Yun

    Abstract: Audio-visual speech recognition (AVSR) has become critical for enhancing speech recognition in noisy environments by integrating both auditory and visual modalities. However, existing AVSR systems struggle to scale up without compromising computational efficiency. In this study, we introduce MoHAVE (Mixture of Hierarchical Audio-Visual Experts), a novel robust AVSR framework designed to address th… ▽ More

    Submitted 21 May, 2025; v1 submitted 11 February, 2025; originally announced February 2025.

    Comments: Accepted to ICML 2025

  47. arXiv:2502.08033  [pdf, other

    cs.RO cs.LG

    Predictive Planner for Autonomous Driving with Consistency Models

    Authors: Anjian Li, Sangjae Bae, David Isele, Ryne Beeson, Faizan M. Tariq

    Abstract: Trajectory prediction and planning are essential for autonomous vehicles to navigate safely and efficiently in dynamic environments. Traditional approaches often treat them separately, limiting the ability for interactive planning. While recent diffusion-based generative models have shown promise in multi-agent trajectory generation, their slow sampling is less suitable for high-frequency planning… ▽ More

    Submitted 2 May, 2025; v1 submitted 11 February, 2025; originally announced February 2025.

  48. arXiv:2502.05349  [pdf, ps, other

    math.OC cs.LG

    Contextual Scenario Generation for Two-Stage Stochastic Programming

    Authors: David Islip, Roy H. Kwon, Sanghyeon Bae, Woo Chang Kim

    Abstract: Two-stage stochastic programs (2SPs) are important tools for making decisions under uncertainty. Decision-makers use contextual information to generate a set of scenarios to represent the true conditional distribution. However, the number of scenarios required is a barrier to implementing 2SPs, motivating the problem of generating a small set of surrogate scenarios that yield high-quality decision… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

    Comments: 47 pages, 10 figures

  49. arXiv:2502.04207  [pdf, other

    cs.CV

    Enhanced Feature-based Image Stitching for Endoscopic Videos in Pediatric Eosinophilic Esophagitis

    Authors: Juming Xiong, Muyang Li, Ruining Deng, Tianyuan Yao, Shunxing Bao, Regina N Tyree, Girish Hiremath, Yuankai Huo

    Abstract: Video endoscopy represents a major advance in the investigation of gastrointestinal diseases. Reviewing endoscopy videos often involves frequent adjustments and reorientations to piece together a complete view, which can be both time-consuming and prone to errors. Image stitching techniques address this issue by providing a continuous and complete visualization of the examined area. However, endos… ▽ More

    Submitted 13 February, 2025; v1 submitted 6 February, 2025; originally announced February 2025.

  50. arXiv:2502.03972  [pdf, other

    cond-mat.mtrl-sci

    Triple-Q state in magnetic breathing kagome lattice

    Authors: Hangyu Zhou, Manuel dos Santos Dias, Shijian Bao, Hanchen Lu, Youguang Zhang, Weisheng Zhao, Samir Lounis

    Abstract: Magnetic frustration in two-dimensional spin lattices with triangular motifs underpins a series of exotic states, ranging from multi-Q configurations to disordered spin-glasses. The antiferromagnetic kagome lattice, characterized by its network of corner-sharing triangles, represents a paradigmatic frustrated system exhibiting macroscopic degeneracy. Expanding upon the kagomerization mechanism, we… ▽ More

    Submitted 6 February, 2025; originally announced February 2025.

    Comments: 27 pages, 4 figures