-
Observation of the decays $B^{+} \to Σ_{c}(2455)^{++} \overlineΞ_{c}^{-}$ and $B^{0} \to Σ_{c}(2455)^{0} \overlineΞ_{c}^{0}$
Authors:
Belle,
Belle II Collaborations,
:,
M. Abumusabh,
I. Adachi,
L. Aggarwal,
H. Ahmed,
Y. Ahn,
H. Aihara,
N. Akopov,
S. Alghamdi,
M. Alhakami,
A. Aloisio,
N. Althubiti,
K. Amos,
N. Anh Ky,
D. M. Asner,
H. Atmacan,
T. Aushev,
V. Aushev,
R. Ayad,
V. Babu,
H. Bae,
N. K. Baghel,
S. Bahinipati
, et al. (364 additional authors not shown)
Abstract:
We report the first observation of the two-body baryonic decays $B^{+} \to Σ_{c}(2455)^{++} \overlineΞ_{c}^{-}$ and $B^{0} \to Σ_{c}(2455)^{0} \overlineΞ_{c}^{0}$ with significances of $7.3\,σ$ and $6.2\,σ$, respectively, including statistical and systematic uncertainties. The branching fractions are measured to be…
▽ More
We report the first observation of the two-body baryonic decays $B^{+} \to Σ_{c}(2455)^{++} \overlineΞ_{c}^{-}$ and $B^{0} \to Σ_{c}(2455)^{0} \overlineΞ_{c}^{0}$ with significances of $7.3\,σ$ and $6.2\,σ$, respectively, including statistical and systematic uncertainties. The branching fractions are measured to be $\mathcal{B}(B^{+} \to Σ_{c}(2455)^{++} \overlineΞ_{c}^{-}) = (5.74 \pm 1.11 \pm 0.42_{-1.53}^{+2.47}) \times 10^{-4}$ and $\mathcal{B}(B^{0} \to Σ_{c}(2455)^{0} \overlineΞ_{c}^{0}) = (4.83 \pm 1.12 \pm 0.37_{-0.60}^{+0.72}) \times 10^{-4}$. The first and second uncertainties are statistical and systematic, respectively, while the third ones arise from the absolute branching fractions of $\overlineΞ_{c}^{-}$ or $\overlineΞ_{c}^{0}$ decays. The data samples used for this analysis have integrated luminosities of 711~$\mathrm{fb}^{-1}$ and 365~$\mathrm{fb}^{-1}$, and were collected at the $Υ(4S)$ resonance by the Belle and Belle~II detectors operating at the KEKB and SuperKEKB asymmetric-energy $e^{+}e^{-}$ colliders, respectively.
△ Less
Submitted 7 July, 2025;
originally announced July 2025.
-
Measurement of the $ D^{0}\rightarrow K^{-}π^{+}e^{+}e^{-} $ branching fraction and search for $ D^{0}\rightarrow π^{+}π^{-}e^{+}e^{-} $ and $D^{0}\rightarrow K^{+}K^{-}e^{+}e^{-} $ decays at Belle
Authors:
Belle,
Belle II Collaborations,
:,
I. Adachi,
L. Aggarwal,
H. Ahmed,
Y. Ahn,
H. Aihara,
N. Akopov,
S. Alghamdi,
M. Alhakami,
A. Aloisio,
N. Althubiti,
K. Amos,
M. Angelsmark,
N. Anh Ky,
C. Antonioli,
D. M. Asner,
H. Atmacan,
T. Aushev,
V. Aushev,
M. Aversano,
R. Ayad,
V. Babu,
H. Bae
, et al. (458 additional authors not shown)
Abstract:
We present a study of the rare charm meson decays $ D^{0}\rightarrow K^{+}K^{-}e^{+}e^{-} $, $ π^{+}π^{-}e^{+}e^{-} $, and $ K^{-}π^{+}e^{+}e^{-} $ using a 942 fb$^{-1}$ data set collected by the Belle detector at the KEKB asymmetric-energy $ e^{+}e^{-} $ collider. We use $ D^{0} $ candidates identified by the charge of the pion in $ D^{*} \rightarrow D^{0} π$ decays and normalize the branching fr…
▽ More
We present a study of the rare charm meson decays $ D^{0}\rightarrow K^{+}K^{-}e^{+}e^{-} $, $ π^{+}π^{-}e^{+}e^{-} $, and $ K^{-}π^{+}e^{+}e^{-} $ using a 942 fb$^{-1}$ data set collected by the Belle detector at the KEKB asymmetric-energy $ e^{+}e^{-} $ collider. We use $ D^{0} $ candidates identified by the charge of the pion in $ D^{*} \rightarrow D^{0} π$ decays and normalize the branching fractions to $ D^{0} \rightarrow K^{-}π^{+}π^{-}π^{+} $ decays. The branching fraction for decay $ D^{0} \rightarrow K^{-}π^{+}e^{+}e^{-} $ is measured to be (39.6 $\pm$ 4.5 (stat) $\pm$ 2.9 (syst)) $\times$ $10^{-7}$, with the dielectron mass in the $ ρ/ω$ mass region $ 675 < m_{ee} < 875 $ MeV$/c^{2}$. We also search for $ D^{0}\rightarrow h^{-} h^{(\prime)+}e^{+}e^{-} $ ($ h^{(\prime)}=K,\,π$) decays with the dielectron mass near the $η$ and $φ$ resonances, and away from these resonances for the $ K^{+}K^{-}e^{+}e^{-} $ and $ π^{+}π^{-}e^{+}e^{-} $ modes. For these modes, we find no significant signals and set 90$\%$ confidence level upper limits on their branching fractions at the $\mathcal{O}$(10$^{-7}$) level.
△ Less
Submitted 7 July, 2025;
originally announced July 2025.
-
Cross sections of $η$ mesons in $p$$+$$p$ collisions at forward rapidity at $\sqrt{s}=500$ GeV and central rapidity at $\sqrt{s}=510$ GeV
Authors:
PHENIX Collaboration,
N. J. Abdulameer,
U. Acharya,
A. Adare,
C. Aidala,
N. N. Ajitanand,
Y. Akiba,
R. Akimoto,
H. Al-Ta'ani,
J. Alexander,
M. Alfred,
D. Anderson,
K. R. Andrews,
A. Angerami,
S. Antsupov,
K. Aoki,
N. Apadula,
E. Appelt,
Y. Aramaki,
R. Armendariz,
H. Asano,
E. C. Aschenauer,
E. T. Atomssa,
T. C. Awes,
B. Azmoun
, et al. (476 additional authors not shown)
Abstract:
We present the first measurements of the forward and midrapidity $η$-meson cross sections from $p$$+$$p$ collisions at $\sqrt{s}=500$ and $510$~GeV, respectively. We also report the midrapidity $η/π^0$ ratio at 510 GeV. The forward cross section is measured differentially in $η$-meson transverse momentum ($p_T$) from 1.0 to 6.5~GeV/$c$ for pseudorapidity $3.0<|η|<3.8$. The midrapidity cross sectio…
▽ More
We present the first measurements of the forward and midrapidity $η$-meson cross sections from $p$$+$$p$ collisions at $\sqrt{s}=500$ and $510$~GeV, respectively. We also report the midrapidity $η/π^0$ ratio at 510 GeV. The forward cross section is measured differentially in $η$-meson transverse momentum ($p_T$) from 1.0 to 6.5~GeV/$c$ for pseudorapidity $3.0<|η|<3.8$. The midrapidity cross section is measured from 3.5 to 44 GeV/$c$ for pseudorapidity $|η|<0.35$. Both cross sections serve as critical inputs to an updated global analysis of the $η$-meson fragmentation functions.
△ Less
Submitted 7 July, 2025;
originally announced July 2025.
-
Jack Unit: An Area- and Energy-Efficient Multiply-Accumulate (MAC) Unit Supporting Diverse Data Formats
Authors:
Seock-Hwan Noh,
Sungju Kim,
Seohyun Kim,
Daehoon Kim,
Jaeha Kung,
Yeseong Kim
Abstract:
In this work, we introduce an area- and energy-efficient multiply-accumulate (MAC) unit, named Jack unit, that is a jack-of-all-trades, supporting various data formats such as integer (INT), floating point (FP), and microscaling data format (MX). It provides bit-level flexibility and enhances hardware efficiency by i) replacing the carry-save multiplier (CSM) in the FP multiplier with a precision-…
▽ More
In this work, we introduce an area- and energy-efficient multiply-accumulate (MAC) unit, named Jack unit, that is a jack-of-all-trades, supporting various data formats such as integer (INT), floating point (FP), and microscaling data format (MX). It provides bit-level flexibility and enhances hardware efficiency by i) replacing the carry-save multiplier (CSM) in the FP multiplier with a precision-scalable CSM, ii) performing the adjustment of significands based on the exponent differences within the CSM, and iii) utilizing 2D sub-word parallelism. To assess effectiveness, we implemented the layout of the Jack unit and three baseline MAC units. Additionally, we designed an AI accelerator equipped with our Jack units to compare with a state-of-the-art AI accelerator supporting various data formats. The proposed MAC unit occupies 1.17~2.01x smaller area and consumes 1.05~1.84x lower power compared to the baseline MAC units. On five AI benchmarks, the accelerator designed with our Jack units improves energy efficiency by 1.32~5.41x over the baseline across various data formats.
△ Less
Submitted 7 July, 2025;
originally announced July 2025.
-
LLM-based Question-Answer Framework for Sensor-driven HVAC System Interaction
Authors:
Sungmin Lee,
Minju Kang,
Joonhee Lee,
Seungyong Lee,
Dongju Kim,
Jingi Hong,
Jun Shin,
Pei Zhang,
JeongGil Ko
Abstract:
Question-answering (QA) interfaces powered by large language models (LLMs) present a promising direction for improving interactivity with HVAC system insights, particularly for non-expert users. However, enabling accurate, real-time, and context-aware interactions with HVAC systems introduces unique challenges, including the integration of frequently updated sensor data, domain-specific knowledge…
▽ More
Question-answering (QA) interfaces powered by large language models (LLMs) present a promising direction for improving interactivity with HVAC system insights, particularly for non-expert users. However, enabling accurate, real-time, and context-aware interactions with HVAC systems introduces unique challenges, including the integration of frequently updated sensor data, domain-specific knowledge grounding, and coherent multi-stage reasoning. In this paper, we present JARVIS, a two-stage LLM-based QA framework tailored for sensor data-driven HVAC system interaction. JARVIS employs an Expert-LLM to translate high-level user queries into structured execution instructions, and an Agent that performs SQL-based data retrieval, statistical processing, and final response generation. To address HVAC-specific challenges, JARVIS integrates (1) an adaptive context injection strategy for efficient HVAC and deployment-specific information integration, (2) a parameterized SQL builder and executor to improve data access reliability, and (3) a bottom-up planning scheme to ensure consistency across multi-stage response generation. We evaluate JARVIS using real-world data collected from a commercial HVAC system and a ground truth QA dataset curated by HVAC experts to demonstrate its effectiveness in delivering accurate and interpretable responses across diverse queries. Results show that JARVIS consistently outperforms baseline and ablation variants in both automated and user-centered assessments, achieving high response quality and accuracy.
△ Less
Submitted 7 July, 2025;
originally announced July 2025.
-
Low-mass vector-meson production at forward rapidity in $p$$+$$p$ and Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$~GeV
Authors:
PHENIX Collaboration,
N. J. Abdulameer,
U. Acharya,
A. Adare,
C. Aidala,
N. N. Ajitanand,
Y. Akiba,
M. Alfred,
D. Anderson,
V. Andrieux,
S. Antsupov,
N. Apadula,
H. Asano,
B. Azmoun,
V. Babintsev,
M. Bai,
N. S. Bandara,
B. Bannier,
E. Bannikov,
K. N. Barish,
S. Bathe,
A. Bazilevsky,
M. Beaumier,
S. Beckman,
R. Belmont
, et al. (331 additional authors not shown)
Abstract:
The PHENIX experiment at the Relativistic Heavy Ion Collider has measured low-mass vector-meson ($ω+ρ$ and $φ$) production through the dimuon decay channel at forward rapidity $(1.2<|\mbox{y}|<2.2)$ in $p$$+$$p$ and Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$~GeV. The low-mass vector-meson yield and nuclear-modification factor were measured as a function of the average number of participating nuc…
▽ More
The PHENIX experiment at the Relativistic Heavy Ion Collider has measured low-mass vector-meson ($ω+ρ$ and $φ$) production through the dimuon decay channel at forward rapidity $(1.2<|\mbox{y}|<2.2)$ in $p$$+$$p$ and Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$~GeV. The low-mass vector-meson yield and nuclear-modification factor were measured as a function of the average number of participating nucleons, $\langle N_{\rm part}\rangle$, and the transverse momentum $p_T$. These results were compared with those obtained via the kaon decay channel in a similar $p_T$ range at midrapidity. The nuclear-modification factors in both rapidity regions are consistent within the uncertainties. A comparison of the $ω+ρ$ and $J/ψ$ mesons reveals that the light and heavy flavors are consistently suppressed across both $p_T$ and ${\langle}N_{\rm part}\rangle$. In contrast, the $φ$ meson displays a nuclear-modification factor consistent with unity, suggesting strangeness enhancement in the medium formed.
△ Less
Submitted 6 July, 2025;
originally announced July 2025.
-
Graphon particle system with common noise
Authors:
Erhan Bayraktar,
Xihao He,
Donghan Kim
Abstract:
We study a nonlinear graphon particle system driven by both idiosyncratic and common noise, where interactions are governed by a graphon and represented as positive finite measures. Each particle evolves via a McKean-Vlasov-type SDE with graphon-weighted conditional laws. We prove a law of large numbers for the empirical and interaction measures, using generalized Wasserstein metrics and weak conv…
▽ More
We study a nonlinear graphon particle system driven by both idiosyncratic and common noise, where interactions are governed by a graphon and represented as positive finite measures. Each particle evolves via a McKean-Vlasov-type SDE with graphon-weighted conditional laws. We prove a law of large numbers for the empirical and interaction measures, using generalized Wasserstein metrics and weak convergence techniques suited for the non-Markovian structure induced by common noise.
△ Less
Submitted 3 July, 2025;
originally announced July 2025.
-
Causal-Paced Deep Reinforcement Learning
Authors:
Geonwoo Cho,
Jaegyun Im,
Doyoon Kim,
Sundong Kim
Abstract:
Designing effective task sequences is crucial for curriculum reinforcement learning (CRL), where agents must gradually acquire skills by training on intermediate tasks. A key challenge in CRL is to identify tasks that promote exploration, yet are similar enough to support effective transfer. While recent approach suggests comparing tasks via their Structural Causal Models (SCMs), the method requir…
▽ More
Designing effective task sequences is crucial for curriculum reinforcement learning (CRL), where agents must gradually acquire skills by training on intermediate tasks. A key challenge in CRL is to identify tasks that promote exploration, yet are similar enough to support effective transfer. While recent approach suggests comparing tasks via their Structural Causal Models (SCMs), the method requires access to ground-truth causal structures, an unrealistic assumption in most RL settings. In this work, we propose Causal-Paced Deep Reinforcement Learning (CP-DRL), a curriculum learning framework aware of SCM differences between tasks based on interaction data approximation. This signal captures task novelty, which we combine with the agent's learnability, measured by reward gain, to form a unified objective. Empirically, CP-DRL outperforms existing curriculum methods on the Point Mass benchmark, achieving faster convergence and higher returns. CP-DRL demonstrates reduced variance with comparable final returns in the Bipedal Walker-Trivial setting, and achieves the highest average performance in the Infeasible variant. These results indicate that leveraging causal relationships between tasks can improve the structure-awareness and sample efficiency of curriculum reinforcement learning. We provide the full implementation of CP-DRL to facilitate the reproduction of our main results at https://github.com/Cho-Geonwoo/CP-DRL.
△ Less
Submitted 24 June, 2025;
originally announced July 2025.
-
Predictive Control over LAWN: Joint Trajectory Design and Resource Allocation
Authors:
Haijia Jin,
Jun Wu,
Weijie Yuan,
Ruizhi Ruan,
Jiacheng Wang,
Dusit Niyato,
Dong In Kim,
Abbas Jamalipour
Abstract:
Low-altitude wireless networks (LAWNs) have been envisioned as flexible and transformative platforms for enabling delay-sensitive control applications in Internet of Things (IoT) systems. In this work, we investigate the real-time wireless control over a LAWN system, where an aerial drone is employed to serve multiple mobile automated guided vehicles (AGVs) via finite blocklength (FBL) transmissio…
▽ More
Low-altitude wireless networks (LAWNs) have been envisioned as flexible and transformative platforms for enabling delay-sensitive control applications in Internet of Things (IoT) systems. In this work, we investigate the real-time wireless control over a LAWN system, where an aerial drone is employed to serve multiple mobile automated guided vehicles (AGVs) via finite blocklength (FBL) transmission. Toward this end, we adopt the model predictive control (MPC) to ensure accurate trajectory tracking, while we analyze the communication reliability using the outage probability. Subsequently, we formulate an optimization problem to jointly determine control policy, transmit power allocation, and drone trajectory by accounting for the maximum travel distance and control input constraints. To address the resultant non-convex optimization problem, we first derive the closed-form expression of the outage probability under FBL transmission. Based on this, we reformulate the original problem as a quadratic programming (QP) problem, followed by developing an alternating optimization (AO) framework. Specifically, we employ the projected gradient descent (PGD) method and the successive convex approximation (SCA) technique to achieve computationally efficient sub-optimal solutions. Furthermore, we thoroughly analyze the convergence and computational complexity of the proposed algorithm. Extensive simulations and AirSim-based experiments are conducted to validate the superiority of our proposed approach compared to the baseline schemes in terms of control performance.
△ Less
Submitted 3 July, 2025;
originally announced July 2025.
-
DoMIX: An Efficient Framework for Exploiting Domain Knowledge in Fine-Tuning
Authors:
Dohoon Kim,
Donghun Kang,
Taesup Moon
Abstract:
Domain-Adaptive Pre-training (DAP) has recently gained attention for its effectiveness in fine-tuning pre-trained models. Building on this, continual DAP has been explored to develop pre-trained models capable of incrementally incorporating different domain datasets. However, existing continual DAP methods face several limitations: (1) high computational cost and GPU memory usage during training;…
▽ More
Domain-Adaptive Pre-training (DAP) has recently gained attention for its effectiveness in fine-tuning pre-trained models. Building on this, continual DAP has been explored to develop pre-trained models capable of incrementally incorporating different domain datasets. However, existing continual DAP methods face several limitations: (1) high computational cost and GPU memory usage during training; (2) sensitivity to incremental data order; and (3) providing a single, generalized model for all end tasks, which contradicts the essence of DAP. In this paper, we propose DoMIX, a novel approach that addresses these challenges by leveraging LoRA modules, a representative parameter-efficient fine-tuning (PEFT) method. Our approach enables efficient and parallel domain-adaptive pre-training that is robust to domain order and effectively utilizes accumulated knowledge to provide tailored pre-trained models for specific tasks. We also demonstrate that our method can be extended beyond the DAP setting to standard LLM fine-tuning scenarios. Code is available at https://github.com/dohoonkim-ai/DoMIX.
△ Less
Submitted 3 July, 2025;
originally announced July 2025.
-
CaptionSmiths: Flexibly Controlling Language Pattern in Image Captioning
Authors:
Kuniaki Saito,
Donghyun Kim,
Kwanyong Park,
Atsushi Hashimoto,
Yoshitaka Ushiku
Abstract:
An image captioning model flexibly switching its language pattern, e.g., descriptiveness and length, should be useful since it can be applied to diverse applications. However, despite the dramatic improvement in generative vision-language models, fine-grained control over the properties of generated captions is not easy due to two reasons: (i) existing models are not given the properties as a cond…
▽ More
An image captioning model flexibly switching its language pattern, e.g., descriptiveness and length, should be useful since it can be applied to diverse applications. However, despite the dramatic improvement in generative vision-language models, fine-grained control over the properties of generated captions is not easy due to two reasons: (i) existing models are not given the properties as a condition during training and (ii) existing models cannot smoothly transition its language pattern from one state to the other. Given this challenge, we propose a new approach, CaptionSmiths, to acquire a single captioning model that can handle diverse language patterns. First, our approach quantifies three properties of each caption, length, descriptiveness, and uniqueness of a word, as continuous scalar values, without human annotation. Given the values, we represent the conditioning via interpolation between two endpoint vectors corresponding to the extreme states, e.g., one for a very short caption and one for a very long caption. Empirical results demonstrate that the resulting model can smoothly change the properties of the output captions and show higher lexical alignment than baselines. For instance, CaptionSmiths reduces the error in controlling caption length by 506\% despite better lexical alignment. Code will be available on https://github.com/omron-sinicx/captionsmiths.
△ Less
Submitted 2 July, 2025;
originally announced July 2025.
-
Multi-User Generative Semantic Communication with Intent-Aware Semantic-Splitting Multiple Access
Authors:
Jiayi Lu,
Wanting Yang,
Zehui Xiong,
Rahim Tafazolli,
Tony Q. S. Quek,
Mérouane Debbah,
Dong In Kim
Abstract:
With the booming development of generative artificial intelligence (GAI), semantic communication (SemCom) has emerged as a new paradigm for reliable and efficient communication. This paper considers a multi-user downlink SemCom system, using vehicular networks as the representative scenario for multi-user content dissemination. To address diverse yet overlapping user demands, we propose a multi-us…
▽ More
With the booming development of generative artificial intelligence (GAI), semantic communication (SemCom) has emerged as a new paradigm for reliable and efficient communication. This paper considers a multi-user downlink SemCom system, using vehicular networks as the representative scenario for multi-user content dissemination. To address diverse yet overlapping user demands, we propose a multi-user Generative SemCom-enhanced intent-aware semantic-splitting multiple access (SS-MGSC) framework. In the framework, we construct an intent-aware shared knowledge base (SKB) that incorporates prior knowledge of semantic information (SI) and user-specific preferences. Then, we designate the common SI as a one-hot semantic map that is broadcast to all users, while the private SI is delivered as personalized text for each user. On the receiver side, a diffusion model enhanced with ControlNet is adopted to generate high-quality personalized images. To capture both semantic relevance and perceptual similarity, we design a novel semantic efficiency score (SES) metric as the optimization objective. Building on this, we formulate a joint optimization problem for multi-user semantic extraction and beamforming, solved using a reinforcement learning-based algorithm due to its robustness in high-dimensional settings. Simulation results demonstrate the effectiveness of the proposed scheme.
△ Less
Submitted 1 July, 2025;
originally announced July 2025.
-
Search for an Axion-Like Particle in $B\rightarrow K^{(*)} a (\rightarrowγγ)$ Decays at Belle
Authors:
Belle,
Belle II Collaborations,
:,
I. Adachi,
L. Aggarwal,
H. Ahmed,
Y. Ahn,
H. Aihara,
N. Akopov,
S. Alghamdi,
M. Alhakami,
A. Aloisio,
N. Althubiti,
K. Amos,
M. Angelsmark,
N. Anh Ky,
C. Antonioli,
D. M. Asner,
H. Atmacan,
T. Aushev,
V. Aushev,
M. Aversano,
R. Ayad,
V. Babu,
H. Bae
, et al. (400 additional authors not shown)
Abstract:
We report a search for an axion-like particle $a$ in $B\rightarrow K^{(*)} a (\rightarrowγγ)$ decays using data collected with the Belle detector at the KEKB asymmetric energy electron-positron collider. The search is based on a $711 \mathrm{fb^{-1}}$ data sample collected at the $Υ4S$ resonance energy, corresponding to a sample of $772\times10^6$ $Υ4S$ events. In this study, we search for the dec…
▽ More
We report a search for an axion-like particle $a$ in $B\rightarrow K^{(*)} a (\rightarrowγγ)$ decays using data collected with the Belle detector at the KEKB asymmetric energy electron-positron collider. The search is based on a $711 \mathrm{fb^{-1}}$ data sample collected at the $Υ4S$ resonance energy, corresponding to a sample of $772\times10^6$ $Υ4S$ events. In this study, we search for the decay of the axion-like particle into a pair of photons, $a \rightarrow γγ$. We scan the two-photon invariant mass in the range $0.16\ \mathrm{GeV/}c^2-4.50\ \mathrm{GeV}/c^2$ for the $K$ modes and $0.16\ \mathrm{GeV/}c^2-4.20\ \mathrm{GeV}/c^2$ for the $K^{*}$ modes. No significant signal is observed in any of the modes, and 90\% confidence level upper limits are established on the coupling to the $W$ boson, $g_aW$, as a function of $a$ mass. The limits range from $3 \times 10^{-6} \mathrm{GeV}^{-1}$ to $3 \times 10^{-5} \mathrm{GeV}^{-1}$, improving the current constraints on $g_aW$ by a factor of two over the most stringent previous experimental results.
△ Less
Submitted 3 July, 2025; v1 submitted 1 July, 2025;
originally announced July 2025.
-
HST pre-imaging of a free-floating planet candidate microlensing event
Authors:
Mateusz Kapusta,
Przemek Mroz,
Yoon-Hyun Ryu,
Andrzej Udalski,
Szymon Kozlowski,
Sean Terry,
Michal K. Szymanski,
Igor Soszynski,
Pawel Pietrukowicz,
Radoslaw Poleski,
Jan Skowron,
Krzysztof Ulaczyk,
Mariusz Gromadzki,
Krzysztof Rybicki,
Patryk Iwanek,
Marcin Wrona,
Mateusz J. Mróz,
Michael D. Albrow,
Sun-Ju Chung,
Andrew Gould,
Cheongho Han,
Kyu-Ha Hwang,
Youn Kil Jung,
In-Gu Shin,
Yossi Shvartzvald
, et al. (11 additional authors not shown)
Abstract:
High-cadence microlensing observations uncovered a population of very short-timescale microlensing events, which are believed to be caused by the population of free-floating planets (FFP) roaming the Milky Way. Unfortunately, the light curves of such events are indistinguishable from those caused by wide-orbit planets. To properly differentiate both cases, one needs high-resolution observations th…
▽ More
High-cadence microlensing observations uncovered a population of very short-timescale microlensing events, which are believed to be caused by the population of free-floating planets (FFP) roaming the Milky Way. Unfortunately, the light curves of such events are indistinguishable from those caused by wide-orbit planets. To properly differentiate both cases, one needs high-resolution observations that would allow resolving a putative luminous companion to the lens long before or after the event. Usually, the baseline between the event and high-resolution observations needs to be quite long ($\sim 10$ yr), hindering potential follow-up efforts. However, there is a chance to use archival data if they exist. Here, we present an analysis of the microlensing event OGLE-2023-BLG-0524, the site of which was captured in 1997 with the Hubble Space Telescope (HST). Hence, we achieve a record-breaking baseline length of 25 years. A very short duration of the event ($t_E = 0.346 \pm 0.008$ d) indicates an FFP as the explanation. We have not detected any potential companion to the lens with the HST data, which is consistent with the FFP origin of the event. Thanks to the available HST data, we are able to reject from 25% to 48% of potential stellar companions depending on the assumed population model. Based on the finite-source effects in the light curve we measure the angular Einstein radius value $θ_E = 4.78 \pm 0.23 μas$, suggesting a super-Earth in the Galactic disk or a sub-Saturn-mass planet in the Galactic bulge. We show that the archival high-resolution images should be available for several microlensing events, providing us with the unprecedented possibility of seeing the lensing system as it was many years before the event.
△ Less
Submitted 1 July, 2025;
originally announced July 2025.
-
New Constraints on Neutrino-Dark Matter Interactions: A Comprehensive Analysis
Authors:
P. S. Bhupal Dev,
Doojin Kim,
Deepak Sathyan,
Kuver Sinha,
Yongchao Zhang
Abstract:
We present a comprehensive analysis of the interactions of neutrinos with the dark sector within the simplified model framework. We first derive the exact analytic formulas for the differential scattering cross sections of neutrinos with scalar, fermion, and vector dark matter (DM) for light dark sector models with mediators of different types. We then implement the full catalog of constraints on…
▽ More
We present a comprehensive analysis of the interactions of neutrinos with the dark sector within the simplified model framework. We first derive the exact analytic formulas for the differential scattering cross sections of neutrinos with scalar, fermion, and vector dark matter (DM) for light dark sector models with mediators of different types. We then implement the full catalog of constraints on the parameter space of the neutrino-DM and neutrino-mediator couplings and masses, including cosmological and astrophysical bounds coming from Big Bang Nucleosynthesis, Cosmic Microwave Background, DM and neutrino self-interactions, DM collisional damping, and astrophysical neutrino sources, as well as laboratory constraints from 3-body meson decays and invisible $Z$ decays. We find that most of the benchmarks in the DM mass-coupling plane adopted in previous studies to get an observable neutrino-DM interaction effect are actually ruled out by a combination of the above-mentioned constraints, especially the laboratory ones which are robust against astrophysical uncertainties and independent of the cosmological history. To illustrate the consequences of our new results, we take the galactic supernova neutrinos in the MeV energy range as a concrete example and highlight the difficulties in finding any observable effect of neutrino-DM interactions. Finally, we identify new benchmark points potentially promising for future observational prospects of the attenuation of the galactic supernova neutrino flux and comment on their implications for the detection prospects in future large-volume neutrino experiments such as JUNO, Hyper-K, and DUNE. We also comment on the ultraviolet-embedding of the effective neutrino-DM couplings.
△ Less
Submitted 1 July, 2025;
originally announced July 2025.
-
Process-aware and high-fidelity microstructure generation using stable diffusion
Authors:
Hoang Cuong Phan,
Minh Tien Tran,
Chihun Lee,
Hoheok Kim,
Sehyok Oh,
Dong-Kyu Kim,
Ho Won Lee
Abstract:
Synthesizing realistic microstructure images conditioned on processing parameters is crucial for understanding process-structure relationships in materials design. However, this task remains challenging due to limited training micrographs and the continuous nature of processing variables. To overcome these challenges, we present a novel process-aware generative modeling approach based on Stable Di…
▽ More
Synthesizing realistic microstructure images conditioned on processing parameters is crucial for understanding process-structure relationships in materials design. However, this task remains challenging due to limited training micrographs and the continuous nature of processing variables. To overcome these challenges, we present a novel process-aware generative modeling approach based on Stable Diffusion 3.5 Large (SD3.5-Large), a state-of-the-art text-to-image diffusion model adapted for microstructure generation. Our method introduces numeric-aware embeddings that encode continuous variables (annealing temperature, time, and magnification) directly into the model's conditioning, enabling controlled image generation under specified process conditions and capturing process-driven microstructural variations. To address data scarcity and computational constraints, we fine-tune only a small fraction of the model's weights via DreamBooth and Low-Rank Adaptation (LoRA), efficiently transferring the pre-trained model to the materials domain. We validate realism using a semantic segmentation model based on a fine-tuned U-Net with a VGG16 encoder on 24 labeled micrographs. It achieves 97.1% accuracy and 85.7% mean IoU, outperforming previous methods. Quantitative analyses using physical descriptors and spatial statistics show strong agreement between synthetic and real microstructures. Specifically, two-point correlation and lineal-path errors remain below 2.1% and 0.6%, respectively. Our method represents the first adaptation of SD3.5-Large for process-aware microstructure generation, offering a scalable approach for data-driven materials design.
△ Less
Submitted 1 July, 2025;
originally announced July 2025.
-
High-Performance Ultra-Wide-Bandgap CaSnO3 Metal-Oxide-Semiconductor Field-Effect Transistors
Authors:
Weideng Sun,
Junghyun Koo,
Donghwan Kim,
Hongseung Lee,
Rishi Raj,
Chengyu Zhu,
Kiyoung Lee,
Andre Mkhoyan,
Hagyoul Bae,
Bharat Jalan,
Gang Qiu
Abstract:
The increasing demand for high-voltage and high-power electronic applications has intensified the search for novel ultrawide bandgap (UWB) semiconductors. Alkaline earth stannates possess wide band gaps and exhibit the highest room-temperature electron mobilities among all perovskite oxides. Among this family, Calcium stannate (CaSnO3) has the largest band gap of ~4.7 eV, holding great promise for…
▽ More
The increasing demand for high-voltage and high-power electronic applications has intensified the search for novel ultrawide bandgap (UWB) semiconductors. Alkaline earth stannates possess wide band gaps and exhibit the highest room-temperature electron mobilities among all perovskite oxides. Among this family, Calcium stannate (CaSnO3) has the largest band gap of ~4.7 eV, holding great promise for high-power applications. However, the demonstration of CaSnO3 power electronic devices is so far limited. In this work, high-performance metal-oxide-semiconductor field-effect transistor (MOSFET) devices based on La-doped CaSnO3 are demonstrated for the first time. The MOSFETs exhibit an on/off ratio exceeding 10^8, along with field-effect mobility of 8.4 cm2 V-1 s-1 and on-state current of 30 mA mm-1. The high performance of the CaSnO3 MOSFET devices can be ascribed to the excellent metal-to-semiconductor contact resistance of 0.73 kΩμm. The devices also show great potential for harsh environment operations, as high-temperature operations up to 400 K have been demonstrated. An off-state breakdown voltage of 1660 V is achieved, with a breakdown field of ~8.3 MV cm-1 among the highest reported for all UWB semiconductors. This work represents significant progress toward realizing the practical application of CaSnO3 in future high-voltage power electronic technologies.
△ Less
Submitted 30 June, 2025;
originally announced June 2025.
-
Oneta: Multi-Style Image Enhancement Using Eigentransformation Functions
Authors:
Jiwon Kim,
Soohyun Hwang,
Dong-O Kim,
Changsu Han,
Min Kyu Park,
Chang-Su Kim
Abstract:
The first algorithm, called Oneta, for a novel task of multi-style image enhancement is proposed in this work. Oneta uses two point operators sequentially: intensity enhancement with a transformation function (TF) and color correction with a color correction matrix (CCM). This two-step enhancement model, though simple, achieves a high performance upper bound. Also, we introduce eigentransformation…
▽ More
The first algorithm, called Oneta, for a novel task of multi-style image enhancement is proposed in this work. Oneta uses two point operators sequentially: intensity enhancement with a transformation function (TF) and color correction with a color correction matrix (CCM). This two-step enhancement model, though simple, achieves a high performance upper bound. Also, we introduce eigentransformation function (eigenTF) to represent TF compactly. The Oneta network comprises Y-Net and C-Net to predict eigenTF and CCM parameters, respectively. To support $K$ styles, Oneta employs $K$ learnable tokens. During training, each style token is learned using image pairs from the corresponding dataset. In testing, Oneta selects one of the $K$ style tokens to enhance an image accordingly. Extensive experiments show that the single Oneta network can effectively undertake six enhancement tasks -- retouching, image signal processing, low-light image enhancement, dehazing, underwater image enhancement, and white balancing -- across 30 datasets.
△ Less
Submitted 30 June, 2025;
originally announced June 2025.
-
MedRegion-CT: Region-Focused Multimodal LLM for Comprehensive 3D CT Report Generation
Authors:
Sunggu Kyung,
Jinyoung Seo,
Hyunseok Lim,
Dongyeong Kim,
Hyungbin Park,
Jimin Sung,
Jihyun Kim,
Wooyoung Jo,
Yoojin Nam,
Namkug Kim
Abstract:
The recent release of RadGenome-Chest CT has significantly advanced CT-based report generation. However, existing methods primarily focus on global features, making it challenging to capture region-specific details, which may cause certain abnormalities to go unnoticed. To address this, we propose MedRegion-CT, a region-focused Multi-Modal Large Language Model (MLLM) framework, featuring three key…
▽ More
The recent release of RadGenome-Chest CT has significantly advanced CT-based report generation. However, existing methods primarily focus on global features, making it challenging to capture region-specific details, which may cause certain abnormalities to go unnoticed. To address this, we propose MedRegion-CT, a region-focused Multi-Modal Large Language Model (MLLM) framework, featuring three key innovations. First, we introduce Region Representative ($R^2$) Token Pooling, which utilizes a 2D-wise pretrained vision model to efficiently extract 3D CT features. This approach generates global tokens representing overall slice features and region tokens highlighting target areas, enabling the MLLM to process comprehensive information effectively. Second, a universal segmentation model generates pseudo-masks, which are then processed by a mask encoder to extract region-centric features. This allows the MLLM to focus on clinically relevant regions, using six predefined region masks. Third, we leverage segmentation results to extract patient-specific attributions, including organ size, diameter, and locations. These are converted into text prompts, enriching the MLLM's understanding of patient-specific contexts. To ensure rigorous evaluation, we conducted benchmark experiments on report generation using the RadGenome-Chest CT. MedRegion-CT achieved state-of-the-art performance, outperforming existing methods in natural language generation quality and clinical relevance while maintaining interpretability. The code for our framework is publicly available.
△ Less
Submitted 29 June, 2025;
originally announced June 2025.
-
VSRM: A Robust Mamba-Based Framework for Video Super-Resolution
Authors:
Dinh Phu Tran,
Dao Duy Hung,
Daeyoung Kim
Abstract:
Video super-resolution remains a major challenge in low-level vision tasks. To date, CNN- and Transformer-based methods have delivered impressive results. However, CNNs are limited by local receptive fields, while Transformers struggle with quadratic complexity, posing challenges for processing long sequences in VSR. Recently, Mamba has drawn attention for its long-sequence modeling, linear comple…
▽ More
Video super-resolution remains a major challenge in low-level vision tasks. To date, CNN- and Transformer-based methods have delivered impressive results. However, CNNs are limited by local receptive fields, while Transformers struggle with quadratic complexity, posing challenges for processing long sequences in VSR. Recently, Mamba has drawn attention for its long-sequence modeling, linear complexity, and large receptive fields. In this work, we propose VSRM, a novel \textbf{V}ideo \textbf{S}uper-\textbf{R}esolution framework that leverages the power of \textbf{M}amba. VSRM introduces Spatial-to-Temporal Mamba and Temporal-to-Spatial Mamba blocks to extract long-range spatio-temporal features and enhance receptive fields efficiently. To better align adjacent frames, we propose Deformable Cross-Mamba Alignment module. This module utilizes a deformable cross-mamba mechanism to make the compensation stage more dynamic and flexible, preventing feature distortions. Finally, we minimize the frequency domain gaps between reconstructed and ground-truth frames by proposing a simple yet effective Frequency Charbonnier-like loss that better preserves high-frequency content and enhances visual quality. Through extensive experiments, VSRM achieves state-of-the-art results on diverse benchmarks, establishing itself as a solid foundation for future research.
△ Less
Submitted 28 June, 2025;
originally announced June 2025.
-
Enhancing Automatic Term Extraction with Large Language Models via Syntactic Retrieval
Authors:
Yongchan Chun,
Minhyuk Kim,
Dongjun Kim,
Chanjun Park,
Heuiseok Lim
Abstract:
Automatic Term Extraction (ATE) identifies domain-specific expressions that are crucial for downstream tasks such as machine translation and information retrieval. Although large language models (LLMs) have significantly advanced various NLP tasks, their potential for ATE has scarcely been examined. We propose a retrieval-based prompting strategy that, in the few-shot setting, selects demonstratio…
▽ More
Automatic Term Extraction (ATE) identifies domain-specific expressions that are crucial for downstream tasks such as machine translation and information retrieval. Although large language models (LLMs) have significantly advanced various NLP tasks, their potential for ATE has scarcely been examined. We propose a retrieval-based prompting strategy that, in the few-shot setting, selects demonstrations according to \emph{syntactic} rather than semantic similarity. This syntactic retrieval method is domain-agnostic and provides more reliable guidance for capturing term boundaries. We evaluate the approach in both in-domain and cross-domain settings, analyzing how lexical overlap between the query sentence and its retrieved examples affects performance. Experiments on three specialized ATE benchmarks show that syntactic retrieval improves F1-score. These findings highlight the importance of syntactic cues when adapting LLMs to terminology-extraction tasks.
△ Less
Submitted 26 June, 2025;
originally announced June 2025.
-
KMT-2022-BLG-0086: Another binary-lens binary-source microlensing event
Authors:
Sun-Ju Chung,
Kyu-Ha Hwang,
Jennifer C. Yee,
Andrew Gould,
Ian A. Bond,
Hongjing Yang,
Michael D. Albrow,
Youn Kil Jung,
Cheongho Han,
Yoon-Hyun Ryu,
In-Gu Shin,
Yossi Shvartzvald,
Weicheng Zang,
Sang-Mok Cha,
Dong-Jin Kim,
Seung-Lee Kim,
Chung-Uk Lee,
Dong-Joo Lee,
Yongseok Lee,
Byeong-Gon Park,
Richard W. Pogge,
Fumio Abe,
David P. Bennett,
Aparna Bhattacharya,
Akihiko Fukui
, et al. (18 additional authors not shown)
Abstract:
We present the analysis of a microlensing event KMT-2022-BLG-0086 of which the overall light curve is not described by a binary-lens single-source (2L1S) model, which suggests the existence of an extra lens or an extra source. We found that the event is best explained by the binary-lens binary-source (2L2S) model, but the 2L2S model is only favored over the triple-lens single-source (3L1S) model b…
▽ More
We present the analysis of a microlensing event KMT-2022-BLG-0086 of which the overall light curve is not described by a binary-lens single-source (2L1S) model, which suggests the existence of an extra lens or an extra source. We found that the event is best explained by the binary-lens binary-source (2L2S) model, but the 2L2S model is only favored over the triple-lens single-source (3L1S) model by $Δχ^{2} \simeq 9$. Although the event has noticeable anomalies around the peak of the light curve, they are not enough covered to constrain the angular Einstein radius $θ_{\rm E}$, thus we only measure the minimum angular Einstein radius $θ_{\rm E,min}$. From the Bayesian analysis, it is found that that the binary lens system is a binary star with masses of $(m_1,m_2)=(0.46^{+0.35}_{-0.25}\, M_\odot, 0.75^{+0.67}_{-0.55}\, M_\odot)$ at a distance of $D_{\rm L}=5.87^{+1.21}_{-1.79}$ kpc, while the triple lens system is a brown dwarf or a massive giant planet in a low-mass binary-star system with masses of $(m_1,m_2,m_3)=(0.43^{+0.41}_{-0.35}\, M_\odot, 0.056^{+0.055}_{-0.047}\, M_\odot, 20.84^{+20.20}_{-17.04}\, M_{\rm J})$ at a distance of $D_{\rm L}=4.06^{+1.39}_{-3.28}$ kpc, indicating a disk lens system. The 2L2S model yields the relative lens-source proper motion of $μ_{\rm rel} \geqslant 4.6\, \rm mas\, yr^{-1}$ that is consistent with the Bayesian result, whereas the 3L1S model yields $μ_{\rm rel} \geqslant 18.9\, \rm mas\, yr^{-1}$, which is more than three times larger than that of a typical disk object of $\sim 6\, \rm mas\, yr^{-1}$ and thus is not consistent with the Bayesian result. This suggests that the event is likely caused by the binary-lens binary-source model.
△ Less
Submitted 25 June, 2025;
originally announced June 2025.
-
Transformer Based Multi-Target Bernoulli Tracking for Maritime Radar
Authors:
Caden Sweeney,
Du Yong Kim,
Branko Ristic,
Brian Cheung
Abstract:
Multi-target tracking in the maritime domain is a challenging problem due to the non-Gaussian and fluctuating characteristics of sea clutter. This article investigates the use of machine learning (ML) to the detection and tracking of low SIR targets in the maritime domain. The proposed method uses a transformer to extract point measurements from range-azimuth maps, before clustering and tracking u…
▽ More
Multi-target tracking in the maritime domain is a challenging problem due to the non-Gaussian and fluctuating characteristics of sea clutter. This article investigates the use of machine learning (ML) to the detection and tracking of low SIR targets in the maritime domain. The proposed method uses a transformer to extract point measurements from range-azimuth maps, before clustering and tracking using the Labelled mulit- Bernoulli (LMB) filter. A measurement driven birth density design based on the transformer attention maps is also developed. The error performance of the transformer based approach is presented and compared with a constant false alarm rate (CFAR) detection technique. The LMB filter is run in two scenarios, an ideal birth approach, and the measurement driven birth approach. Experiments indicate that the transformer based method has superior performance to the CFAR approach for all target scenarios discussed
△ Less
Submitted 25 June, 2025;
originally announced June 2025.
-
A Survey of Predictive Maintenance Methods: An Analysis of Prognostics via Classification and Regression
Authors:
Ainaz Jamshidi,
Dongchan Kim,
Muhammad Arif
Abstract:
Predictive maintenance (PdM) has become a crucial element of modern industrial practice. PdM plays a significant role in operational dependability and cost management by decreasing unforeseen downtime and optimizing asset life cycle management. Machine learning and deep learning have enabled more precise forecasts of equipment failure and remaining useful life (RUL). Although many studies have bee…
▽ More
Predictive maintenance (PdM) has become a crucial element of modern industrial practice. PdM plays a significant role in operational dependability and cost management by decreasing unforeseen downtime and optimizing asset life cycle management. Machine learning and deep learning have enabled more precise forecasts of equipment failure and remaining useful life (RUL). Although many studies have been conducted on PdM, there has not yet been a standalone comparative study between regression- and classification-based approaches. In this review, we look across a range of PdM methodologies, while focusing more strongly on the comparative use of classification and regression methods in prognostics. While regression-based methods typically provide estimates of RUL, classification-based methods present a forecast of the probability of failure across defined time intervals. Through a comprehensive analysis of recent literature, we highlight key advancements, challenges-such as data imbalance and high-dimensional feature spaces-and emerging trends, including hybrid approaches and AI-enabled prognostic systems. This review aims to provide researchers and practitioners with an awareness of the strengths and compromises of various PdM methods and to help identify future research and build more robust, directed adaptive maintenance systems. Future work may include a systematic review of practical aspects such as public datasets, benchmarking platforms, and open-source tools to support the advancement of PdM research.
△ Less
Submitted 24 June, 2025;
originally announced June 2025.
-
ProCaliper: functional and structural analysis, visualization, and annotation of proteins
Authors:
Jordan C. Rozum,
Hunter Ufford,
Alexandria K. Im,
Tong Zhang,
David D. Pollock,
Doo Nam Kim,
Song Feng
Abstract:
Understanding protein function at the molecular level requires connecting residue-level annotations with physical and structural properties. This can be cumbersome and error-prone when functional annotation, computation of physico-chemical properties, and structure visualization are separated. To address this, we introduce ProCaliper, an open-source Python library for computing and visualizing phy…
▽ More
Understanding protein function at the molecular level requires connecting residue-level annotations with physical and structural properties. This can be cumbersome and error-prone when functional annotation, computation of physico-chemical properties, and structure visualization are separated. To address this, we introduce ProCaliper, an open-source Python library for computing and visualizing physico-chemical properties of proteins. It can retrieve annotation and structure data from UniProt and AlphaFold databases, compute residue-level properties such as charge, solvent accessibility, and protonation state, and interactively visualize the results of these computations along with user-supplied residue-level data. Additionally, ProCaliper incorporates functional and structural information to construct and optionally sparsify networks that encode the distance between residues and/or annotated functional sites or regions. The package ProCaliper and its source code, along with the code used to generate the figures in this manuscript, are freely available at https://github.com/PNNL-Predictive-Phenomics/ProCaliper.
△ Less
Submitted 24 June, 2025;
originally announced June 2025.
-
MedErr-CT: A Visual Question Answering Benchmark for Identifying and Correcting Errors in CT Reports
Authors:
Sunggu Kyung,
Hyungbin Park,
Jinyoung Seo,
Jimin Sung,
Jihyun Kim,
Dongyeong Kim,
Wooyoung Jo,
Yoojin Nam,
Sangah Park,
Taehee Kwon,
Sang Min Lee,
Namkug Kim
Abstract:
Computed Tomography (CT) plays a crucial role in clinical diagnosis, but the growing demand for CT examinations has raised concerns about diagnostic errors. While Multimodal Large Language Models (MLLMs) demonstrate promising comprehension of medical knowledge, their tendency to produce inaccurate information highlights the need for rigorous validation. However, existing medical visual question an…
▽ More
Computed Tomography (CT) plays a crucial role in clinical diagnosis, but the growing demand for CT examinations has raised concerns about diagnostic errors. While Multimodal Large Language Models (MLLMs) demonstrate promising comprehension of medical knowledge, their tendency to produce inaccurate information highlights the need for rigorous validation. However, existing medical visual question answering (VQA) benchmarks primarily focus on simple visual recognition tasks, lacking clinical relevance and failing to assess expert-level knowledge. We introduce MedErr-CT, a novel benchmark for evaluating medical MLLMs' ability to identify and correct errors in CT reports through a VQA framework. The benchmark includes six error categories - four vision-centric errors (Omission, Insertion, Direction, Size) and two lexical error types (Unit, Typo) - and is organized into three task levels: classification, detection, and correction. Using this benchmark, we quantitatively assess the performance of state-of-the-art 3D medical MLLMs, revealing substantial variation in their capabilities across different error types. Our benchmark contributes to the development of more reliable and clinically applicable MLLMs, ultimately helping reduce diagnostic errors and improve accuracy in clinical practice. The code and datasets are available at https://github.com/babbu3682/MedErr-CT.
△ Less
Submitted 23 June, 2025;
originally announced June 2025.
-
A Novel Analysis Framework for Microstructural Characterization of Ferroelectric Hafnia: Experimental Validation and Application
Authors:
Yoonsang Park,
Jaeduck Jang,
Hyangsook Lee,
Kihong Kim,
Kyooho Jung,
Yunseong Lee,
Jaewoo Lee,
Eunji Yang,
Sanghyun Jo,
Sijung Yoo,
Hyun Jae Lee,
Donghoon Kim,
Duk-Hyun Choe,
Seunggeol Nam
Abstract:
Herein, we present a novel analysis framework for grain size profile of ferroelectric hafnia to tackle critical shortcomings inherent in the current microstructural analysis. We vastly enhanced visibility of grains with ion beam treatment and performed accurate grain segmentation using deep neural network (DNN). By leveraging our new method, we discovered unexpected discrepancies that contradict p…
▽ More
Herein, we present a novel analysis framework for grain size profile of ferroelectric hafnia to tackle critical shortcomings inherent in the current microstructural analysis. We vastly enhanced visibility of grains with ion beam treatment and performed accurate grain segmentation using deep neural network (DNN). By leveraging our new method, we discovered unexpected discrepancies that contradict previous results, such as deposition temperature (Tdep) and post-metallization annealing (PMA) dependence of grain size statistics, prompting us to reassess earlier interpretations. Combining microstructural analysis with electrical tests, we found that grain size reduction had both positive and negative outcomes: it caused significant diminishing of die-to-die variation (~68 % decrease in standard deviation) in coercive field (Ec), while triggering an upsurge in leakage current. These uncovered results signify robustness of our method in characterization of ferroelectric hafnia for in-depth examination of both device variability and reliability.
△ Less
Submitted 23 June, 2025;
originally announced June 2025.
-
Object-aware Sound Source Localization via Audio-Visual Scene Understanding
Authors:
Sung Jin Um,
Dongjin Kim,
Sangmin Lee,
Jung Uk Kim
Abstract:
Audio-visual sound source localization task aims to spatially localize sound-making objects within visual scenes by integrating visual and audio cues. However, existing methods struggle with accurately localizing sound-making objects in complex scenes, particularly when visually similar silent objects coexist. This limitation arises primarily from their reliance on simple audio-visual corresponden…
▽ More
Audio-visual sound source localization task aims to spatially localize sound-making objects within visual scenes by integrating visual and audio cues. However, existing methods struggle with accurately localizing sound-making objects in complex scenes, particularly when visually similar silent objects coexist. This limitation arises primarily from their reliance on simple audio-visual correspondence, which does not capture fine-grained semantic differences between sound-making and silent objects. To address these challenges, we propose a novel sound source localization framework leveraging Multimodal Large Language Models (MLLMs) to generate detailed contextual information that explicitly distinguishes between sound-making foreground objects and silent background objects. To effectively integrate this detailed information, we introduce two novel loss functions: Object-aware Contrastive Alignment (OCA) loss and Object Region Isolation (ORI) loss. Extensive experimental results on MUSIC and VGGSound datasets demonstrate the effectiveness of our approach, significantly outperforming existing methods in both single-source and multi-source localization scenarios. Code and generated detailed contextual information are available at: https://github.com/VisualAIKHU/OA-SSL.
△ Less
Submitted 23 June, 2025; v1 submitted 23 June, 2025;
originally announced June 2025.
-
Rethinking Decoder Design: Improving Biomarker Segmentation Using Depth-to-Space Restoration and Residual Linear Attention
Authors:
Saad Wazir,
Daeyoung Kim
Abstract:
Segmenting biomarkers in medical images is crucial for various biotech applications. Despite advances, Transformer and CNN based methods often struggle with variations in staining and morphology, limiting feature extraction. In medical image segmentation, where datasets often have limited sample availability, recent state-of-the-art (SOTA) methods achieve higher accuracy by leveraging pre-trained…
▽ More
Segmenting biomarkers in medical images is crucial for various biotech applications. Despite advances, Transformer and CNN based methods often struggle with variations in staining and morphology, limiting feature extraction. In medical image segmentation, where datasets often have limited sample availability, recent state-of-the-art (SOTA) methods achieve higher accuracy by leveraging pre-trained encoders, whereas end-to-end methods tend to underperform. This is due to challenges in effectively transferring rich multiscale features from encoders to decoders, as well as limitations in decoder efficiency. To address these issues, we propose an architecture that captures multi-scale local and global contextual information and a novel decoder design, which effectively integrates features from the encoder, emphasizes important channels and regions, and reconstructs spatial dimensions to enhance segmentation accuracy. Our method, compatible with various encoders, outperforms SOTA methods, as demonstrated by experiments on four datasets and ablation studies. Specifically, our method achieves absolute performance gains of 2.76% on MoNuSeg, 3.12% on DSB, 2.87% on Electron Microscopy, and 4.03% on TNBC datasets compared to existing SOTA methods. Code: https://github.com/saadwazir/MCADS-Decoder
△ Less
Submitted 23 June, 2025;
originally announced June 2025.
-
Residual Connection-Enhanced ConvLSTM for Lithium Dendrite Growth Prediction
Authors:
Hosung Lee,
Byeongoh Hwang,
Dasan Kim,
Myungjoo Kang
Abstract:
The growth of lithium dendrites significantly impacts the performance and safety of rechargeable batteries, leading to short circuits and capacity degradation. This study proposes a Residual Connection-Enhanced ConvLSTM model to predict dendrite growth patterns with improved accuracy and computational efficiency. By integrating residual connections into ConvLSTM, the model mitigates the vanishing…
▽ More
The growth of lithium dendrites significantly impacts the performance and safety of rechargeable batteries, leading to short circuits and capacity degradation. This study proposes a Residual Connection-Enhanced ConvLSTM model to predict dendrite growth patterns with improved accuracy and computational efficiency. By integrating residual connections into ConvLSTM, the model mitigates the vanishing gradient problem, enhances feature retention across layers, and effectively captures both localized dendrite growth dynamics and macroscopic battery behavior. The dataset was generated using a phase-field model, simulating dendrite evolution under varying conditions. Experimental results show that the proposed model achieves up to 7% higher accuracy and significantly reduces mean squared error (MSE) compared to conventional ConvLSTM across different voltage conditions (0.1V, 0.3V, 0.5V). This highlights the effectiveness of residual connections in deep spatiotemporal networks for electrochemical system modeling. The proposed approach offers a robust tool for battery diagnostics, potentially aiding in real-time monitoring and optimization of lithium battery performance. Future research can extend this framework to other battery chemistries and integrate it with real-world experimental data for further validation
△ Less
Submitted 21 June, 2025;
originally announced June 2025.
-
Training-free LLM Verification via Recycling Few-shot Examples
Authors:
Dongseok Lee,
Jimyung Hong,
Dongyoung Kim,
Jaehyung Kim
Abstract:
Although LLMs have achieved remarkable performance, the inherent stochasticity of their reasoning process and varying conclusions present significant challenges. Majority voting or Best-of-N with external verification models has been explored to find the most promising solution among multiple LLM outputs. However, these approaches have certain limitations, such as limited applicability or the cost…
▽ More
Although LLMs have achieved remarkable performance, the inherent stochasticity of their reasoning process and varying conclusions present significant challenges. Majority voting or Best-of-N with external verification models has been explored to find the most promising solution among multiple LLM outputs. However, these approaches have certain limitations, such as limited applicability or the cost of an additional training step. To address this problem, we propose a novel and effective framework that Recycles Few-shot examples to verify LLM outputs (Referi). Our key idea is to additionally utilize the given few-shot examples to evaluate the candidate outputs of the target query, not only using them to generate outputs as the conventional few-shot prompting setup. Specifically, Referi evaluates the generated outputs by combining two different scores, designed motivated from Bayes' rule, and subsequently selects the candidate that is both confidently determined and contextually coherent through a few additional LLM inferences. Experiments with three different LLMs and across seven diverse tasks demonstrate that our framework significantly improves the accuracy of LLMs-achieving an average gain of 4.8%-through effective response selection, without additional training.
△ Less
Submitted 8 June, 2025;
originally announced June 2025.
-
Transfer-matrix approach to the Blume-Capel model on the triangular lattice
Authors:
Dimitrios Mataragkas,
Alexandros Vasilopoulos,
Nikolaos G. Fytas,
Dong-Hee Kim
Abstract:
We investigate the spin-$1$ Blume-Capel model on an infinite strip of the triangular lattice using the transfer-matrix method combined with a sparse-matrix factorization technique. Through finite-size scaling analysis of numerically exact spectra for strip widths up to $L = 19$, we accurately locate the tricritical point improving upon recent Monte Carlo estimates. In the first-order regime, we ob…
▽ More
We investigate the spin-$1$ Blume-Capel model on an infinite strip of the triangular lattice using the transfer-matrix method combined with a sparse-matrix factorization technique. Through finite-size scaling analysis of numerically exact spectra for strip widths up to $L = 19$, we accurately locate the tricritical point improving upon recent Monte Carlo estimates. In the first-order regime, we observe exponential scaling of the spectral gap, reflecting the linear growth of interfacial tension as the temperature decreases below the tricritical point. Finally, we validate our tricritical point estimate through precise agreement with conformal field theory predictions for the tricritical Ising universality class. Our results underscore the continued utility of the transfer-matrix approach for studying phase transitions in complex lattice models.
△ Less
Submitted 19 June, 2025;
originally announced June 2025.
-
Supernova-Boosted Dark Matter at Large-Volume Neutrino Detectors
Authors:
Badal Bhalla,
Fazlollah Hajkarim,
Doojin Kim,
Kuver Sinha
Abstract:
Core-collapse supernovae, among the universe's most energetic events, offer a novel window into the dark sector by potentially producing a flux of boosted dark matter (BDM). We explore the potential to detect the BDM produced by supernovae with a focus on fermionic dark matter that interacts with the visible sector through a dark gauge boson. We consider the expected BDM flux at Earth, originating…
▽ More
Core-collapse supernovae, among the universe's most energetic events, offer a novel window into the dark sector by potentially producing a flux of boosted dark matter (BDM). We explore the potential to detect the BDM produced by supernovae with a focus on fermionic dark matter that interacts with the visible sector through a dark gauge boson. We consider the expected BDM flux at Earth, originating from both the diffuse background of all galactic supernovae and potentially strong signals from individual nearby events. Focusing on BDM-electron scattering, we project the sensitivity of major current and future large-volume neutrino detectors - DUNE, Hyper-Kamiokande, and JUNO - to this elusive signal. Our results indicate that these experiments can significantly constrain or discover BDM within compelling parameter spaces, with sensitivity notably enhanced during nearby supernova occurrences. We further emphasize the unique multi-messenger opportunity presented by a galactic supernova, where the characteristic time delay between the neutrino burst and the BDM signal arrival could provide powerful evidence and enable probes of dark matter properties.
△ Less
Submitted 18 June, 2025;
originally announced June 2025.
-
CXL-GPU: Pushing GPU Memory Boundaries with the Integration of CXL Technologies
Authors:
Donghyun Gouk,
Seungkwan Kang,
Seungjun Lee,
Jiseon Kim,
Kyungkuk Nam,
Eojin Ryu,
Sangwon Lee,
Dongpyung Kim,
Junhyeok Jang,
Hanyeoreum Bae,
Myoungsoo Jung
Abstract:
This work introduces a GPU storage expansion solution utilizing CXL, featuring a novel GPU system design with multiple CXL root ports for integrating diverse storage media (DRAMs and/or SSDs). We developed and siliconized a custom CXL controller integrated at the hardware RTL level, achieving two-digit nanosecond roundtrip latency, the first in the field. This study also includes speculative read…
▽ More
This work introduces a GPU storage expansion solution utilizing CXL, featuring a novel GPU system design with multiple CXL root ports for integrating diverse storage media (DRAMs and/or SSDs). We developed and siliconized a custom CXL controller integrated at the hardware RTL level, achieving two-digit nanosecond roundtrip latency, the first in the field. This study also includes speculative read and deterministic store mechanisms to efficiently manage read and write operations to hide the endpoint's backend media latency variation. Performance evaluations reveal our approach significantly outperforms existing methods, marking a substantial advancement in GPU storage technology.
△ Less
Submitted 18 June, 2025;
originally announced June 2025.
-
New Physics Opportunities at Neutrino Facilities: BSM Physics at Accelerator, Atmospheric, and Reactor Neutrino Experiments
Authors:
Koun Choi,
Doojin Kim,
Jong-Chul Park,
Seodong Shin,
Pouya Bakhti,
Ki-Young Choi,
Chang Hyon Ha,
Kazumi Hata,
Wooyoung Jang,
Yu Seon Jeong,
Young Ju Ko,
Hyun Su Lee,
Weijun Li,
Yu-Feng Li,
Mehedi Masud,
Kenny C. Y. Ng,
Jungsic Park,
Min-Gwa Park,
Komninos-John Plows,
Meshkat Rajaee,
Eunil Won,
Byeongsu Yang,
Seong Moon Yoo,
Jaehoon Yu,
Seokhoon Yun
Abstract:
Since the discovery of the Higgs boson, the long-standing task at hand in particle physics is the search for new physics beyond the Standard Model, which accounts for only about 5\% of the Universe.
In light of this situation, the neutrino sector has drawn significant attention due to neutrino oscillations, which require physics beyond the Standard Model and have prompted a wide array of active…
▽ More
Since the discovery of the Higgs boson, the long-standing task at hand in particle physics is the search for new physics beyond the Standard Model, which accounts for only about 5\% of the Universe.
In light of this situation, the neutrino sector has drawn significant attention due to neutrino oscillations, which require physics beyond the Standard Model and have prompted a wide array of active and planned experimental programs.
Notably, neutrino facilities offer substantial potential to search for new physics beyond neutrino oscillations, owing to their precision measurement capabilities, diverse experimental configurations, and various neutrino sources.
This paper provides a review of the landscape of new physics that can be probed at current and future neutrino experiments, categorized into laboratory-produced and cosmogenic signals.
We discuss recent experimental results interpreted through the lens of new physics, as well as detailed plans and projected sensitivities of next-generation facilities.
This review is based on presentations from the 4th Workshop on New Physics Opportunities in Neutrino Facilities (NPN 2024), held at IBS in Daejeon, Korea, on June 3-5, 2024.
Particular emphasis is placed on accelerator-based neutrino experiments and a range of neutrino programs in East Asia.
We also outline key tasks necessary to realize the promising new physics opportunities ahead.
△ Less
Submitted 18 June, 2025;
originally announced June 2025.
-
Determination of $|V_{cb}|$ using $B\to D\ellν_\ell$ Decays at Belle II
Authors:
Belle II Collaboration,
I. Adachi,
K. Adamczyk,
L. Aggarwal,
H. Ahmed,
Y. Ahn,
H. Aihara,
N. Akopov,
S. Alghamdi,
M. Alhakami,
A. Aloisio,
K. Amos,
M. Angelsmark,
N. Anh Ky,
C. Antonioli,
D. M. Asner,
H. Atmacan,
T. Aushev,
V. Aushev,
M. Aversano,
R. Ayad,
V. Babu,
H. Bae,
N. K. Baghel,
S. Bahinipati
, et al. (385 additional authors not shown)
Abstract:
We present a determination of the Cabibbo-Kobayashi-Maskawa matrix element $|V_{cb}|$ from the decay $B\to D\ellν_\ell$ using a $365~\mathrm{fb}^{-1}$ $e^+e^-\toΥ(4S)\to B\bar B$ data sample recorded by the Belle II experiment at the SuperKEKB collider. The semileptonic decay of one $B$ meson is reconstructed in the modes $B^0\to D^-(\to K^+π^-π^-)\ell^+ν_\ell$ and…
▽ More
We present a determination of the Cabibbo-Kobayashi-Maskawa matrix element $|V_{cb}|$ from the decay $B\to D\ellν_\ell$ using a $365~\mathrm{fb}^{-1}$ $e^+e^-\toΥ(4S)\to B\bar B$ data sample recorded by the Belle II experiment at the SuperKEKB collider. The semileptonic decay of one $B$ meson is reconstructed in the modes $B^0\to D^-(\to K^+π^-π^-)\ell^+ν_\ell$ and $B^+\to \bar D^0(\to K^+π^-)\ell^+ν_\ell$, where $\ell$ denotes either an electron or a muon. Charge conjugation is implied. The second $B$ meson in the $Υ(4S)$ event is not reconstructed explicitly. Using an inclusive reconstruction of the unobserved neutrino momentum, we determine the recoil variable $w=v_B\cdot v_D$, where $v_B$ and $v_D$ are the 4-velocities of the $B$ and $D$ mesons. We measure the total decay branching fractions to be $\mathcal{B}(B^0\to D^-\ell^+ν_\ell)=(2.06 \pm 0.05\,(\mathrm{stat.}) \pm 0.10\,(\mathrm{sys.}))\%$ and $\mathcal{B}(B^+\to\bar D^0\ell^+ν_\ell)=(2.31 \pm 0.04\,(\mathrm{stat.}) \pm 0.09\,(\mathrm{sys.}))\%$. We probe lepton flavor universality by measuring $\mathcal{B}(B\to Deν_e)/\mathcal{B}(B\to Dμν_μ)=1.020 \pm 0.020\,(\mathrm{stat.})\pm 0.022\,(\mathrm{sys.})$. Fitting the partial decay branching fraction as a function of $w$ and using the average of lattice QCD calculations of the $B\to D$ form factor, we obtain $ |V_{cb}|=(39.2\pm 0.4\,(\mathrm{stat.}) \pm 0.6\,(\mathrm{sys.}) \pm 0.5\,(\mathrm{th.})$.
△ Less
Submitted 18 June, 2025;
originally announced June 2025.
-
Doppelganger Method: Breaking Role Consistency in LLM Agent via Prompt-based Transferable Adversarial Attack
Authors:
Daewon Kang,
YeongHwan Shin,
Doyeon Kim,
Kyu-Hwan Jung,
Meong Hi Son
Abstract:
Since the advent of large language models, prompt engineering now enables the rapid, low-effort creation of diverse autonomous agents that are already in widespread use. Yet this convenience raises urgent concerns about the safety, robustness, and behavioral consistency of the underlying prompts, along with the pressing challenge of preventing those prompts from being exposed to user's attempts. I…
▽ More
Since the advent of large language models, prompt engineering now enables the rapid, low-effort creation of diverse autonomous agents that are already in widespread use. Yet this convenience raises urgent concerns about the safety, robustness, and behavioral consistency of the underlying prompts, along with the pressing challenge of preventing those prompts from being exposed to user's attempts. In this paper, we propose the ''Doppelganger method'' to demonstrate the risk of an agent being hijacked, thereby exposing system instructions and internal information. Next, we define the ''Prompt Alignment Collapse under Adversarial Transfer (PACAT)'' level to evaluate the vulnerability to this adversarial transfer attack. We also propose a ''Caution for Adversarial Transfer (CAT)'' prompt to counter the Doppelganger method. The experimental results demonstrate that the Doppelganger method can compromise the agent's consistency and expose its internal information. In contrast, CAT prompts enable effective defense against this adversarial attack.
△ Less
Submitted 26 June, 2025; v1 submitted 17 June, 2025;
originally announced June 2025.
-
Déjà Vu: Efficient Video-Language Query Engine with Learning-based Inter-Frame Computation Reuse
Authors:
Jinwoo Hwang,
Daeun Kim,
Sangyeop Lee,
Yoonsung Kim,
Guseul Heo,
Hojoon Kim,
Yunseok Jeong,
Tadiwos Meaza,
Eunhyeok Park,
Jeongseob Ahn,
Jongse Park
Abstract:
Recently, Video-Language Models (VideoLMs) have demonstrated remarkable capabilities, offering significant potential for flexible and powerful video query systems. These models typically rely on Vision Transformers (ViTs), which process video frames individually to extract visual embeddings. However, generating embeddings for large-scale videos requires ViT inferencing across numerous frames, posi…
▽ More
Recently, Video-Language Models (VideoLMs) have demonstrated remarkable capabilities, offering significant potential for flexible and powerful video query systems. These models typically rely on Vision Transformers (ViTs), which process video frames individually to extract visual embeddings. However, generating embeddings for large-scale videos requires ViT inferencing across numerous frames, posing a major hurdle to real-world deployment and necessitating solutions for integration into scalable video data management systems. This paper introduces Déjà Vu, a video-language query engine that accelerates ViT-based VideoLMs by reusing computations across consecutive frames. At its core is ReuseViT, a modified ViT model specifically designed for VideoLM tasks, which learns to detect inter-frame reuse opportunities, striking an effective balance between accuracy and reuse. Although ReuseViT significantly reduces computation, these savings do not directly translate into performance gains on GPUs. To overcome this, Déjà Vu integrates memory-compute joint compaction techniques that convert the FLOP savings into tangible performance gains. Evaluations on three VideoLM tasks show that Déjà Vu accelerates embedding generation by up to a 2.64x within a 2% error bound, dramatically enhancing the practicality of VideoLMs for large-scale video analytics.
△ Less
Submitted 16 June, 2025;
originally announced June 2025.
-
A Study on Effective Initial Guess Finding Method Based on Bézier Curves: Orbit Determination Applications
Authors:
Daegyun Choi,
Sungwook Yang,
Henzeh Leeghim,
Donghoon Kim
Abstract:
In celestial mechanics, proper orbits related to missions are obtained by solving two-point boundary value problems. Since a selection method of initial value affects the convergence of the solution, developing an effective method to find an initial guess is required. In this work, Bézier curves, which can describe complicated curves and surfaces, are utilized to find the initial guess. First, the…
▽ More
In celestial mechanics, proper orbits related to missions are obtained by solving two-point boundary value problems. Since a selection method of initial value affects the convergence of the solution, developing an effective method to find an initial guess is required. In this work, Bézier curves, which can describe complicated curves and surfaces, are utilized to find the initial guess. First, the given problems are transformed into Bézier curves forms, and Bézier curves' control points, which can handle the shape of curves, are selected by solving the system of nonlinear equations. Finally, the initial guess is obtained by substituting the calculated control points to Bézier curves. To validate the performance of the proposed method, numerical simulations are conducted with respect to three kinds of orbits, which are from circular to highly elliptical orbit (HEO). The proposed method is compared to the general shooting method. The comparison results show that the initial guess calculated by Bézier curves makes finding the solution more efficient in terms of computational time and iterations. Also, it shows that the proposed method finds the solution for the HEO while the general shooting method fails to find the solution.
△ Less
Submitted 16 June, 2025;
originally announced June 2025.
-
LittleBit: Ultra Low-Bit Quantization via Latent Factorization
Authors:
Banseok Lee,
Dongkyu Kim,
Youngcheon You,
Youngmin Kim
Abstract:
Deploying large language models (LLMs) often faces challenges from substantial memory and computational costs. Quantization offers a solution, yet performance degradation in the sub-1-bit regime remains particularly difficult. This paper introduces LittleBit, a novel method for extreme LLM compression. It targets levels like 0.1 bits per weight (BPW), achieving nearly 31$\times$ memory reduction,…
▽ More
Deploying large language models (LLMs) often faces challenges from substantial memory and computational costs. Quantization offers a solution, yet performance degradation in the sub-1-bit regime remains particularly difficult. This paper introduces LittleBit, a novel method for extreme LLM compression. It targets levels like 0.1 bits per weight (BPW), achieving nearly 31$\times$ memory reduction, e.g., Llama2-13B to under 0.9 GB. LittleBit represents weights in a low-rank form using latent matrix factorization, subsequently binarizing these factors. To counteract information loss from this extreme precision, it integrates a multi-scale compensation mechanism. This includes row, column, and an additional latent dimension that learns per-rank importance. Two key contributions enable effective training: Dual Sign-Value-Independent Decomposition (Dual-SVID) for stable quantization-aware training (QAT) initialization, and integrated Residual Compensation to mitigate errors. Extensive experiments confirm LittleBit's superiority in sub-1-bit quantization: e.g., its 0.1 BPW performance on Llama2-7B surpasses the leading method's 0.7 BPW. This establishes a superior size-performance trade-off, with kernel-level benchmarks indicating potential for a 5$\times$ speedup compared to FP16. LittleBit paves the way for deploying powerful LLMs in resource-constrained environments.
△ Less
Submitted 30 May, 2025;
originally announced June 2025.
-
A valuative approach to the anticanonical minimal model program
Authors:
Sung Rak Choi,
Sungwook Jang,
Donghyeon Kim,
Dae-Won Lee
Abstract:
In this paper, we show that the log canonical threshold of a potentially klt triple can be computed by a quasi-monomial valuation. The notion of potential triples provides a larger and more flexible framework to work with than that of generalized pairs. Our main result can be considered as an extension to the result of Xu on klt pairs. As an application of the main result, we show that we can run…
▽ More
In this paper, we show that the log canonical threshold of a potentially klt triple can be computed by a quasi-monomial valuation. The notion of potential triples provides a larger and more flexible framework to work with than that of generalized pairs. Our main result can be considered as an extension to the result of Xu on klt pairs. As an application of the main result, we show that we can run the MMP on any potentially klt triples and $-(K_X+Δ)$-MMP on the potentially klt pairs.
△ Less
Submitted 16 June, 2025;
originally announced June 2025.
-
On volumes and the generic invariance of Fano type varieties
Authors:
Donghyeon Kim
Abstract:
We demonstrate the generic invariance of the Fano type property in cases where the volumes of anti-canonical divisors of Fano type fibers are a constant over a Zariski-dense subset, or the Fano type fibers are dimension $2$. Additionally, paralleling this theorem, we establish a conjecture by Schwede and Smith under the condition that the volumes of anti-canonical divisors remain constant in the r…
▽ More
We demonstrate the generic invariance of the Fano type property in cases where the volumes of anti-canonical divisors of Fano type fibers are a constant over a Zariski-dense subset, or the Fano type fibers are dimension $2$. Additionally, paralleling this theorem, we establish a conjecture by Schwede and Smith under the condition that the volumes of anti-canonical divisors remain constant in the reduction mod $p$.
△ Less
Submitted 16 June, 2025;
originally announced June 2025.
-
Delving into Instance-Dependent Label Noise in Graph Data: A Comprehensive Study and Benchmark
Authors:
Suyeon Kim,
SeongKu Kang,
Dongwoo Kim,
Jungseul Ok,
Hwanjo Yu
Abstract:
Graph Neural Networks (GNNs) have achieved state-of-the-art performance in node classification tasks but struggle with label noise in real-world data. Existing studies on graph learning with label noise commonly rely on class-dependent label noise, overlooking the complexities of instance-dependent noise and falling short of capturing real-world corruption patterns. We introduce BeGIN (Benchmarkin…
▽ More
Graph Neural Networks (GNNs) have achieved state-of-the-art performance in node classification tasks but struggle with label noise in real-world data. Existing studies on graph learning with label noise commonly rely on class-dependent label noise, overlooking the complexities of instance-dependent noise and falling short of capturing real-world corruption patterns. We introduce BeGIN (Benchmarking for Graphs with Instance-dependent Noise), a new benchmark that provides realistic graph datasets with various noise types and comprehensively evaluates noise-handling strategies across GNN architectures, noisy label detection, and noise-robust learning. To simulate instance-dependent corruptions, BeGIN introduces algorithmic methods and LLM-based simulations. Our experiments reveal the challenges of instance-dependent noise, particularly LLM-based corruption, and underscore the importance of node-specific parameterization to enhance GNN robustness. By comprehensively evaluating noise-handling strategies, BeGIN provides insights into their effectiveness, efficiency, and key performance factors. We expect that BeGIN will serve as a valuable resource for advancing research on label noise in graphs and fostering the development of robust GNN training methods. The code is available at https://github.com/kimsu55/BeGIN.
△ Less
Submitted 16 June, 2025; v1 submitted 14 June, 2025;
originally announced June 2025.
-
From Ground to Sky: Architectures, Applications, and Challenges Shaping Low-Altitude Wireless Networks
Authors:
Weijie Yuan,
Yuanhao Cui,
Jiacheng Wang,
Fan Liu,
Geng Sun,
Tao Xiang,
Jie Xu,
Shi Jin,
Dusit Niyato,
Sinem Coleri,
Sumei Sun,
Shiwen Mao,
Abbas Jamalipour,
Dong In Kim,
Mohamed-Slim Alouini,
Xuemin Shen
Abstract:
In this article, we introduce a novel low-altitude wireless network (LAWN), which is a reconfigurable, three-dimensional (3D) layered architecture. In particular, the LAWN integrates connectivity, sensing, control, and computing across aerial and terrestrial nodes that enable seamless operation in complex, dynamic, and mission-critical environments. Different from the conventional aerial communica…
▽ More
In this article, we introduce a novel low-altitude wireless network (LAWN), which is a reconfigurable, three-dimensional (3D) layered architecture. In particular, the LAWN integrates connectivity, sensing, control, and computing across aerial and terrestrial nodes that enable seamless operation in complex, dynamic, and mission-critical environments. Different from the conventional aerial communication systems, LAWN's distinctive feature is its tight integration of functional planes in which multiple functionalities continually reshape themselves to operate safely and efficiently in the low-altitude sky. With the LAWN, we discuss several enabling technologies, such as integrated sensing and communication (ISAC), semantic communication, and fully-actuated control systems. Finally, we identify potential applications and key cross-layer challenges. This article offers a comprehensive roadmap for future research and development in the low-altitude airspace.
△ Less
Submitted 16 June, 2025; v1 submitted 13 June, 2025;
originally announced June 2025.
-
Diffusion-Based Electrocardiography Noise Quantification via Anomaly Detection
Authors:
Tae-Seong Han,
Jae-Wook Heo,
Hakseung Kim,
Cheol-Hui Lee,
Hyub Huh,
Eue-Keun Choi,
Dong-Joo Kim
Abstract:
Electrocardiography (ECG) signals are often degraded by noise, which complicates diagnosis in clinical and wearable settings. This study proposes a diffusion-based framework for ECG noise quantification via reconstruction-based anomaly detection, addressing annotation inconsistencies and the limited generalizability of conventional methods. We introduce a distributional evaluation using the Wasser…
▽ More
Electrocardiography (ECG) signals are often degraded by noise, which complicates diagnosis in clinical and wearable settings. This study proposes a diffusion-based framework for ECG noise quantification via reconstruction-based anomaly detection, addressing annotation inconsistencies and the limited generalizability of conventional methods. We introduce a distributional evaluation using the Wasserstein-1 distance ($W_1$), comparing the reconstruction error distributions between clean and noisy ECGs to mitigate inconsistent annotations. Our final model achieved robust noise quantification using only three reverse diffusion steps. The model recorded a macro-average $W_1$ score of 1.308 across the benchmarks, outperforming the next-best method by over 48%. External validations demonstrated strong generalizability, supporting the exclusion of low-quality segments to enhance diagnostic accuracy and enable timely clinical responses to signal degradation. The proposed method enhances clinical decision-making, diagnostic accuracy, and real-time ECG monitoring capabilities, supporting future advancements in clinical and wearable ECG applications.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
Collaborative LLM Inference via Planning for Efficient Reasoning
Authors:
Byeongchan Lee,
Jonghoon Lee,
Dongyoung Kim,
Jaehyung Kim,
Jinwoo Shin
Abstract:
Large language models (LLMs) excel at complex reasoning tasks, but those with strong capabilities (e.g., whose numbers of parameters are larger than 100B) are often accessible only through paid APIs, making them too costly for applications of frequent use. In contrast, smaller open-sourced LLMs (e.g., whose numbers of parameters are less than 3B) are freely available and easy to deploy locally (e.…
▽ More
Large language models (LLMs) excel at complex reasoning tasks, but those with strong capabilities (e.g., whose numbers of parameters are larger than 100B) are often accessible only through paid APIs, making them too costly for applications of frequent use. In contrast, smaller open-sourced LLMs (e.g., whose numbers of parameters are less than 3B) are freely available and easy to deploy locally (e.g., under a single GPU having 8G VRAM), but lack suff icient reasoning ability. This trade-off raises a natural question: can small (free) and large (costly) models collaborate at test time to combine their strengths? We propose a test-time collaboration framework in which a planner model first generates a plan, defined as a distilled and high-level abstraction of the problem.
This plan serves as a lightweight intermediate that guides a reasoner model, which generates a complete solution. Small and large models take turns acting as planner and reasoner, exchanging plans in a multi-round cascade to collaboratively solve complex tasks. Our method achieves accuracy comparable to strong proprietary models alone, while significantly reducing reliance on paid inference. These results highlight planning as an effective prior for orchestrating cost-aware, cross-model inference under real-world deployment constraints.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
Debiasing Online Preference Learning via Preference Feature Preservation
Authors:
Dongyoung Kim,
Jinsung Yoon,
Jinwoo Shin,
Jaehyung Kim
Abstract:
Recent preference learning frameworks for large language models (LLMs) simplify human preferences with binary pairwise comparisons and scalar rewards. This simplification could make LLMs' responses biased to mostly preferred features, and would be exacerbated during the iterations of online preference learning steps. To address these challenges, we propose a novel framework coined PFP (Preference…
▽ More
Recent preference learning frameworks for large language models (LLMs) simplify human preferences with binary pairwise comparisons and scalar rewards. This simplification could make LLMs' responses biased to mostly preferred features, and would be exacerbated during the iterations of online preference learning steps. To address these challenges, we propose a novel framework coined PFP (Preference Feature Preservation). The key idea of PFP is maintaining the distribution of human preference features and utilizing such rich signals throughout the online preference learning process. Specifically, PFP first extract preference features from offline pairwise human preference data and trains a feature classifier. Then, using trained classifier and the distribution preserving optimization, PFP maps appropriate preference features for a new input instruction during online learning. Lastly, PFP trains LLM using the existing preference learning method, by incorporating the preference feature into system prompts and enabling LLM to explicitly handle various human preferences. Our experiments demonstrate that PFP successfully mitigates the bias in preference features during online learning, and hence achieves superior performance compared to previous preference learning methods on standard benchmarks to evaluate LLM alignment.
△ Less
Submitted 6 June, 2025;
originally announced June 2025.
-
TexTailor: Customized Text-aligned Texturing via Effective Resampling
Authors:
Suin Lee,
Dae-Shik Kim
Abstract:
We present TexTailor, a novel method for generating consistent object textures from textual descriptions. Existing text-to-texture synthesis approaches utilize depth-aware diffusion models to progressively generate images and synthesize textures across predefined multiple viewpoints. However, these approaches lead to a gradual shift in texture properties across viewpoints due to (1) insufficient i…
▽ More
We present TexTailor, a novel method for generating consistent object textures from textual descriptions. Existing text-to-texture synthesis approaches utilize depth-aware diffusion models to progressively generate images and synthesize textures across predefined multiple viewpoints. However, these approaches lead to a gradual shift in texture properties across viewpoints due to (1) insufficient integration of previously synthesized textures at each viewpoint during the diffusion process and (2) the autoregressive nature of the texture synthesis process. Moreover, the predefined selection of camera positions, which does not account for the object's geometry, limits the effective use of texture information synthesized from different viewpoints, ultimately degrading overall texture consistency. In TexTailor, we address these issues by (1) applying a resampling scheme that repeatedly integrates information from previously synthesized textures within the diffusion process, and (2) fine-tuning a depth-aware diffusion model on these resampled textures. During this process, we observed that using only a few training images restricts the model's original ability to generate high-fidelity images aligned with the conditioning, and therefore propose an performance preservation loss to mitigate this issue. Additionally, we improve the synthesis of view-consistent textures by adaptively adjusting camera positions based on the object's geometry. Experiments on a subset of the Objaverse dataset and the ShapeNet car dataset demonstrate that TexTailor outperforms state-of-the-art methods in synthesizing view-consistent textures. The source code for TexTailor is available at https://github.com/Adios42/Textailor
△ Less
Submitted 12 June, 2025;
originally announced June 2025.
-
Optimizing brightness of SPDC source in Laguerre-Gaussian modes using type-0 periodically-poled nonlinear crystal
Authors:
Jungmo Lee,
Kyungdeuk Park,
Dongkyu Kim,
Yonggi Jo,
Dong-Gil Im,
Yong Sup Ihn
Abstract:
Photon pairs generated via spontaneous parametric down-conversion (SPDC) can exhibit entanglement in the Laguerre-Gaussian (LG) mode basis, which enables high-dimensional free-space quantum communication by exploiting the high-dimensional space spanned by the LG modes. For such free-space quantum communication, the brightness of the quantum light source plays an important role due to the atmospher…
▽ More
Photon pairs generated via spontaneous parametric down-conversion (SPDC) can exhibit entanglement in the Laguerre-Gaussian (LG) mode basis, which enables high-dimensional free-space quantum communication by exploiting the high-dimensional space spanned by the LG modes. For such free-space quantum communication, the brightness of the quantum light source plays an important role due to the atmospheric turbulence and photon loss. A variety of studies have analyzed the SPDC brightness by decomposing biphoton states into LG modes, but they have often relied on a degenerate state, a narrow spectral bandwidth approximation, or a thin crystal approximation. However, these approaches are unsuitable for non-degenerate type-0 SPDC with a periodicallypoled nonlinear crystal, which offers higher brightness due to its superior nonlinear coefficients. In this study, we examine the spectrum of photon pairs in specific LG modes generated by a type-0 ppKTP crystal whileavoiding the constraints imposed by the aforementioned assumptions. In addition, we investigate the optimal focal parameters of the pump, signal, and idler to maximize the brightness for a given LG mode. Our findings show that it is not feasible to simultaneously optimize the brightness for different LG modes with a single pump focal parameter. The results of this study provide a comprehensive framework for developing highbrightness quantum light sources and contribute to the advancement of high-dimensional free-space quantum communication.
△ Less
Submitted 1 July, 2025; v1 submitted 12 June, 2025;
originally announced June 2025.
-
Single Cu Atom Sites on Co3O4 Activate Interfacial Oxygen for Enhanced Reactivity and Selective Gas Sensing at Low Temperature
Authors:
Hamin Shin,
Matteo D'Andria,
Jaehyun Ko,
Dong-Ha Kim,
Frank Krumeich,
Andreas T. Guentner
Abstract:
Controlling the redox landscape of transition metal oxides is central to advancing their reactivity for heterogeneous catalysis or high-performance gas sensing. Here we report single Cu atom sites (1.42 wt%) anchored on Co3O4 nanoparticles (Cu1-Co3O4) that dramatically enhance reactivity and molecular sensing properties of the support at low temperature. The Cu1 are identified by X-ray adsorption…
▽ More
Controlling the redox landscape of transition metal oxides is central to advancing their reactivity for heterogeneous catalysis or high-performance gas sensing. Here we report single Cu atom sites (1.42 wt%) anchored on Co3O4 nanoparticles (Cu1-Co3O4) that dramatically enhance reactivity and molecular sensing properties of the support at low temperature. The Cu1 are identified by X-ray adsorption near edge structure and feature strong metal-support interaction between Cu2+ and Co3O4, as revealed by X-ray photoelectron spectroscopy. The ability of Cu1 to form interfacial Cu-O-Co linkages strongly reduces the temperature of lattice oxygen activation compared to CuO nanoparticles on Co3O4 (CuONP-Co3O4), as demonstrated by temperature-programmed reduction and desorption analyses. To demonstrate immediate practical impact, we deploy such Cu1-Co3O4 nanoparticles as chemoresistive sensor for formaldehyde vapor that yields more than an order of magnitude higher response than CuONP-Co3O4 and consistently outperforms state-of-the-art sensors. That way, formaldehyde is detected down to 5 parts-per-billion at 50% relative humidity and 75 °C with excellent selectivity over various critical interferents. These results establish a mechanistic platform for activating redox-active supports using single-atom isolates of non-noble nature that yield drastically enhanced and well-defined reactivity to promote low-temperature oxidation reactions and selective analyte sensing.
△ Less
Submitted 11 June, 2025;
originally announced June 2025.