-
Eigen-Value: Efficient Domain-Robust Data Valuation via Eigenvalue-Based Approach
Authors:
Youngjun Choi,
Joonseong Kang,
Sungjun Lim,
Kyungwoo Song
Abstract:
Data valuation has become central in the era of data-centric AI. It drives efficient training pipelines and enables objective pricing in data markets by assigning a numeric value to each data point. Most existing data valuation methods estimate the effect of removing individual data points by evaluating changes in model validation performance under in-distribution (ID) settings, as opposed to out-…
▽ More
Data valuation has become central in the era of data-centric AI. It drives efficient training pipelines and enables objective pricing in data markets by assigning a numeric value to each data point. Most existing data valuation methods estimate the effect of removing individual data points by evaluating changes in model validation performance under in-distribution (ID) settings, as opposed to out-of-distribution (OOD) scenarios where data follow different patterns. Since ID and OOD data behave differently, data valuation methods based on ID loss often fail to generalize to OOD settings, particularly when the validation set contains no OOD data. Furthermore, although OOD-aware methods exist, they involve heavy computational costs, which hinder practical deployment. To address these challenges, we introduce \emph{Eigen-Value} (EV), a plug-and-play data valuation framework for OOD robustness that uses only an ID data subset, including during validation. EV provides a new spectral approximation of domain discrepancy, which is the gap of loss between ID and OOD using ratios of eigenvalues of ID data's covariance matrix. EV then estimates the marginal contribution of each data point to this discrepancy via perturbation theory, alleviating the computational burden. Subsequently, EV plugs into ID loss-based methods by adding an EV term without any additional training loop. We demonstrate that EV achieves improved OOD robustness and stable value rankings across real-world datasets, while remaining computationally lightweight. These results indicate that EV is practical for large-scale settings with domain shift, offering an efficient path to OOD-robust data valuation.
△ Less
Submitted 27 October, 2025;
originally announced October 2025.
-
VR-Drive: Viewpoint-Robust End-to-End Driving with Feed-Forward 3D Gaussian Splatting
Authors:
Hoonhee Cho,
Jae-Young Kang,
Giwon Lee,
Hyemin Yang,
Heejun Park,
Seokwoo Jung,
Kuk-Jin Yoon
Abstract:
End-to-end autonomous driving (E2E-AD) has emerged as a promising paradigm that unifies perception, prediction, and planning into a holistic, data-driven framework. However, achieving robustness to varying camera viewpoints, a common real-world challenge due to diverse vehicle configurations, remains an open problem. In this work, we propose VR-Drive, a novel E2E-AD framework that addresses viewpo…
▽ More
End-to-end autonomous driving (E2E-AD) has emerged as a promising paradigm that unifies perception, prediction, and planning into a holistic, data-driven framework. However, achieving robustness to varying camera viewpoints, a common real-world challenge due to diverse vehicle configurations, remains an open problem. In this work, we propose VR-Drive, a novel E2E-AD framework that addresses viewpoint generalization by jointly learning 3D scene reconstruction as an auxiliary task to enable planning-aware view synthesis. Unlike prior scene-specific synthesis approaches, VR-Drive adopts a feed-forward inference strategy that supports online training-time augmentation from sparse views without additional annotations. To further improve viewpoint consistency, we introduce a viewpoint-mixed memory bank that facilitates temporal interaction across multiple viewpoints and a viewpoint-consistent distillation strategy that transfers knowledge from original to synthesized views. Trained in a fully end-to-end manner, VR-Drive effectively mitigates synthesis-induced noise and improves planning under viewpoint shifts. In addition, we release a new benchmark dataset to evaluate E2E-AD performance under novel camera viewpoints, enabling comprehensive analysis. Our results demonstrate that VR-Drive is a scalable and robust solution for the real-world deployment of end-to-end autonomous driving systems.
△ Less
Submitted 27 October, 2025;
originally announced October 2025.
-
Mind the Gap -- Imaging Buried Interfaces in Twisted Oxide Moirés
Authors:
Harikrishnan KP,
Xin Wei,
Chia-Hao Lee,
Dasol Yoon,
Yonghun Lee,
Kevin J. Crust,
Yu-Tsun Shao,
Ruijuan Xu,
Jong-Hoon Kang,
Ce Liang,
Jiwoong Park,
Harold Y. Hwang,
David A. Muller
Abstract:
The ability to tune electronic structure in twisted stacks of layered, two-dimensional (2D) materials has motivated the exploration of similar moiré physics with stacks of twisted oxide membranes. Due to the intrinsic three-dimensional (3D) nature of bonding in many oxides, achieving atomic-level coupling is significantly more challenging than in 2D van der Waals materials. Although clean interfac…
▽ More
The ability to tune electronic structure in twisted stacks of layered, two-dimensional (2D) materials has motivated the exploration of similar moiré physics with stacks of twisted oxide membranes. Due to the intrinsic three-dimensional (3D) nature of bonding in many oxides, achieving atomic-level coupling is significantly more challenging than in 2D van der Waals materials. Although clean interfaces with atomic level proximity have been demonstrated in ceramic bicrystals using high-temperature and high-pressure processing to facilitate atomic diffusion that flattens rough interfaces, such conditions are not readily accessible when bonding oxide membranes. This study shows how topographic mismatch due to surface roughness of the membranes can restrict atomic-scale proximity at the interface to isolated patches even after obvious issues of contaminants and amorphous interlayers are eliminated. In hybrid interfaces between a chemically inert 2D material and an oxide membrane, the reduced ability of the 2D material to conform to the membrane's step-terrace topography also limits atomic-scale contact. In all these material systems, the interface morphology is best characterized using cross-sectional imaging and is necessary to corroborate investigations of interlayer coupling. When imaging the bicrystal in projection, conventional through-focal imaging is found to be relatively insensitive to the buried interface, whereas electron ptychography reliably resolves structural variations on the order of a nanometer. These findings highlight interface roughness as a key challenge for the field of oxide twistronics and emphasizes the need for reliable characterization methods.
△ Less
Submitted 27 October, 2025;
originally announced October 2025.
-
Discovery of multi-temperature coronal mass ejection signatures from a young solar analogue
Authors:
Kosuke Namekata,
Kevin France,
Jongchul Chae,
Vladimir S. Airapetian,
Adam Kowalski,
Yuta Notsu,
Peter R. Young,
Satoshi Honda,
Soosang Kang,
Juhyung Kang,
Kyeore Lee,
Hiroyuki Maehara,
Kyoung-Sun Lee,
Cole Tamburri,
Tomohito Ohshima,
Masaki Takayama,
Kazunari Shibata
Abstract:
Coronal mass ejections (CMEs) on the early Sun may have profoundly influenced the planetary atmospheres of early Solar System planets. Flaring young solar analogues serve as excellent proxies for probing the plasma environment of the young Sun, yet their CMEs remain poorly understood. Here we report the detection of multi-wavelength Doppler shifts in Far-Ultraviolet (FUV) and optical lines during…
▽ More
Coronal mass ejections (CMEs) on the early Sun may have profoundly influenced the planetary atmospheres of early Solar System planets. Flaring young solar analogues serve as excellent proxies for probing the plasma environment of the young Sun, yet their CMEs remain poorly understood. Here we report the detection of multi-wavelength Doppler shifts in Far-Ultraviolet (FUV) and optical lines during a flare on the young solar analog EK Draconis. During and before a Carrington-class ($\sim$10$^{32}$ erg) flare, warm FUV lines ($\sim$10$^5$ K) exhibit blueshifted emission at 300-550 km s$^{-1}$, indicative of a warm eruption. 10 minutes later, the H$α$ line shows slow (70 km s$^{-1}$), long-lasting ($\gtrsim$2 hrs) blueshifted absorptions, suggesting a cool ($\sim$10$^4$ K) filament eruption. This provides evidence of multi-temperature and multi-component nature of a stellar CME. If Carrington-class flares/CMEs occurred frequently on the young Sun, they may have cumulatively impacted the early Earth's magnetosphere and atmosphere.
△ Less
Submitted 24 October, 2025;
originally announced October 2025.
-
Constraints on ultra-heavy dark matter from the CDEX-10 experiment at the China Jinping Underground Laboratory
Authors:
Y. F. Wang,
L. T. Yang,
Q. Yue,
K. J. Kang,
Y. J. Li,
H. P. An,
Greeshma C.,
J. P. Chang,
H. Chen,
Y. H. Chen,
J. P. Cheng,
J. Y. Cui,
W. H. Dai,
Z. Deng,
Y. X. Dong,
C. H. Fang,
H. Gong,
Q. J. Guo,
T. Guo,
X. Y. Guo,
L. He,
J. R. He,
H. X. Huang,
T. C. Huang,
S. Karmakar
, et al. (63 additional authors not shown)
Abstract:
We report a search for ultra-heavy dark matter (UHDM) with the CDEX-10 experiment at the China Jinping Underground Laboratory (CJPL). Using a Monte Carlo framework that incorporates Earth shielding effects, we simulated UHDM propagation and energy deposition in p-type point-contact germanium detectors ($p$PCGe). Analysis of 205.4 kg$\cdot$day exposure in the 0.16-4.16 keVee range showed no excess…
▽ More
We report a search for ultra-heavy dark matter (UHDM) with the CDEX-10 experiment at the China Jinping Underground Laboratory (CJPL). Using a Monte Carlo framework that incorporates Earth shielding effects, we simulated UHDM propagation and energy deposition in p-type point-contact germanium detectors ($p$PCGe). Analysis of 205.4 kg$\cdot$day exposure in the 0.16-4.16 keVee range showed no excess above background. Our results exclude the spin-independent UHDM-nucleon scattering with two cross section scales, with the UHDM mass from $10^6$ GeV to $10^{11}$ GeV, and provide the most stringent constraints with solid-state detectors below $10^8$ GeV.
△ Less
Submitted 24 October, 2025;
originally announced October 2025.
-
Combining metal dewetting and lateral etching for the scalable top-down fabrication of GaN nanowire arrays with independently tunable diameter and spacing
Authors:
Jingxuan Kang,
Rose-Mary Jose,
Oliver Brandt,
Lutz Geelhaar
Abstract:
The top-down fabrication of nanowires based on patterning via metal dewetting is a cost-effective and scalable approach that is particularly suited for applications requiring large arrays of nanowires. Advantageously, the nanowire diameter can be tailored by the initial metal film thickness. However, we show here that metal dewetting inherently leads to a coupling between the nanowire diameter and…
▽ More
The top-down fabrication of nanowires based on patterning via metal dewetting is a cost-effective and scalable approach that is particularly suited for applications requiring large arrays of nanowires. Advantageously, the nanowire diameter can be tailored by the initial metal film thickness. However, we show here that metal dewetting inherently leads to a coupling between the nanowire diameter and spacing. To overcome this limitation, we introduce two strategies that are exemplified for GaN nanowires: (i) modification of the surface and interface energies within the dewetting system, and (ii) thinning of the nanowires by lateral etching. In the first strategy, GaN(0001), SiOx, and SiNx substrate surfaces are combined with Au, Pt, and Pt-Au alloy dewetting metals to tune the dewetting behavior. The differences in interface energies affect the relation between nanowire diameter and spacing, albeit within a limited range. The second strategy adds a lateral etching step to the conventional top-down nanowire fabrication process. This step at the same time reduces the nanowire diameter and increases the spacing, thus enabling combinations beyond the constraints of metal dewetting alone. When in addition different initial nanowire diameters are employed, it is possible to independently control diameter and spacing over a substantially extended range. Therefore, the inherent limitation of conventional dewetting-based patterning approaches for the top-down fabrication of nanowires is overcome.
△ Less
Submitted 24 October, 2025;
originally announced October 2025.
-
MedAlign: A Synergistic Framework of Multimodal Preference Optimization and Federated Meta-Cognitive Reasoning
Authors:
Siyong Chen,
Jinbo Wen,
Jiawen Kang,
Tenghui Huang,
Xumin Huang,
Yuanjia Su,
Hudan Pan,
Zishao Zhong,
Dusit Niyato,
Shengli Xie,
Dong In Kim
Abstract:
Recently, large models have shown significant potential for smart healthcare. However, the deployment of Large Vision-Language Models (LVLMs) for clinical services is currently hindered by three critical challenges: a tendency to hallucinate answers not grounded in visual evidence, the inefficiency of fixed-depth reasoning, and the difficulty of multi-institutional collaboration. To address these…
▽ More
Recently, large models have shown significant potential for smart healthcare. However, the deployment of Large Vision-Language Models (LVLMs) for clinical services is currently hindered by three critical challenges: a tendency to hallucinate answers not grounded in visual evidence, the inefficiency of fixed-depth reasoning, and the difficulty of multi-institutional collaboration. To address these challenges, in this paper, we develop MedAlign, a novel framework to ensure visually accurate LVLM responses for Medical Visual Question Answering (Med-VQA). Specifically, we first propose a multimodal Direct Preference Optimization (mDPO) objective to explicitly align preference learning with visual context. We then design a Retrieval-Aware Mixture-of-Experts (RA-MoE) architecture that utilizes image and text similarity to route queries to a specialized and context-augmented LVLM (i.e., an expert), thereby mitigating hallucinations in LVLMs. To achieve adaptive reasoning and facilitate multi-institutional collaboration, we propose a federated governance mechanism, where the selected expert, fine-tuned on clinical datasets based on mDPO, locally performs iterative Chain-of-Thought (CoT) reasoning via the local meta-cognitive uncertainty estimator. Extensive experiments on three representative Med-VQA datasets demonstrate that MedAlign achieves state-of-the-art performance, outperforming strong retrieval-augmented baselines by up to $11.85\%$ in F1-score, and simultaneously reducing the average reasoning length by $51.60\%$ compared with fixed-depth CoT approaches.
△ Less
Submitted 23 October, 2025;
originally announced October 2025.
-
Multi-Rate Task-Oriented Communication for Multi-Edge Cooperative Inference
Authors:
Dongwon Kim,
Jiwan Seo,
Joonhyuk Kang
Abstract:
The integration of artificial intelligence (AI) with the Internet of Things (IoT) enables task-oriented communication for multi-edge cooperative inference system, where edge devices transmit extracted features of local sensory data to an edge server to perform AI-driven tasks. However, the privacy concerns and limited communication bandwidth pose fundamental challenges, since simultaneous transmis…
▽ More
The integration of artificial intelligence (AI) with the Internet of Things (IoT) enables task-oriented communication for multi-edge cooperative inference system, where edge devices transmit extracted features of local sensory data to an edge server to perform AI-driven tasks. However, the privacy concerns and limited communication bandwidth pose fundamental challenges, since simultaneous transmission of extracted features with a single fixed compression ratio from all devices leads to severe inefficiency in communication resource utilization. To address this challenge, we propose a framework that dynamically adjusts the code rate in feature extraction based on its importance to the downstream inference task by adopting a rate-adaptive quantization (RAQ) scheme. Furthermore, to select the code rate for each edge device under limited bandwidth constraint, a dynamic programming (DP) approach is leveraged to allocate the code rate across discrete code rate options. Experiments on multi-view datasets demonstrate that the proposed frameworks significantly outperform the frameworks using fixed-rate quantization, achieving a favorable balance between communication efficiency and inference performance under limited bandwidth conditions.
△ Less
Submitted 27 October, 2025; v1 submitted 22 October, 2025;
originally announced October 2025.
-
ParaVul: A Parallel Large Language Model and Retrieval-Augmented Framework for Smart Contract Vulnerability Detection
Authors:
Tenghui Huang,
Jinbo Wen,
Jiawen Kang,
Siyong Chen,
Zhengtao Li,
Tao Zhang,
Dongning Liu,
Jiacheng Wang,
Chengjun Cai,
Yinqiu Liu,
Dusit Niyato
Abstract:
Smart contracts play a significant role in automating blockchain services. Nevertheless, vulnerabilities in smart contracts pose serious threats to blockchain security. Currently, traditional detection methods primarily rely on static analysis and formal verification, which can result in high false-positive rates and poor scalability. Large Language Models (LLMs) have recently made significant pro…
▽ More
Smart contracts play a significant role in automating blockchain services. Nevertheless, vulnerabilities in smart contracts pose serious threats to blockchain security. Currently, traditional detection methods primarily rely on static analysis and formal verification, which can result in high false-positive rates and poor scalability. Large Language Models (LLMs) have recently made significant progress in smart contract vulnerability detection. However, they still face challenges such as high inference costs and substantial computational overhead. In this paper, we propose ParaVul, a parallel LLM and retrieval-augmented framework to improve the reliability and accuracy of smart contract vulnerability detection. Specifically, we first develop Sparse Low-Rank Adaptation (SLoRA) for LLM fine-tuning. SLoRA introduces sparsification by incorporating a sparse matrix into quantized LoRA-based LLMs, thereby reducing computational overhead and resource requirements while enhancing their ability to understand vulnerability-related issues. We then construct a vulnerability contract dataset and develop a hybrid Retrieval-Augmented Generation (RAG) system that integrates dense retrieval with Best Matching 25 (BM25), assisting in verifying the results generated by the LLM. Furthermore, we propose a meta-learning model to fuse the outputs of the RAG system and the LLM, thereby generating the final detection results. After completing vulnerability detection, we design chain-of-thought prompts to guide LLMs to generate comprehensive vulnerability detection reports. Simulation results demonstrate the superiority of ParaVul, especially in terms of F1 scores, achieving 0.9398 for single-label detection and 0.9330 for multi-label detection.
△ Less
Submitted 19 October, 2025;
originally announced October 2025.
-
A UV to X-Ray View of Soft Excess in Type 1 Active Galactic Nuclei. II. Broadband Correlations
Authors:
Shi-Jiang Chen,
Jun-Xian Wang,
Jia-Lai Kang,
Wen-Yong Kang,
Hao Sou,
Teng Liu,
Zhen-Yi Cai,
Zhen-Bo Su
Abstract:
The physical origin of soft X-ray excess (SE) is a long lasting question, with two prevailing theories -- ``warm corona'' and ``ionized reflection'' -- dominating the discussion. In the warm corona scenario, SE originates from upscattered disk photons and should therefore correlate strongly with UV emission. Conversely, in the ionized reflection scenario, SE arises from the illumination of the acc…
▽ More
The physical origin of soft X-ray excess (SE) is a long lasting question, with two prevailing theories -- ``warm corona'' and ``ionized reflection'' -- dominating the discussion. In the warm corona scenario, SE originates from upscattered disk photons and should therefore correlate strongly with UV emission. Conversely, in the ionized reflection scenario, SE arises from the illumination of the accretion disk by the hot corona and should primarily correlate with the hard X-ray primary continuum (PC). In this second paper of the series, we investigate the correlations among SE, UV and PC, leveraging a sample of 59 unobscured type 1 AGNs compiled in \citet{Chen+2025a}. Our extensive analysis reveals a strong intrinsic correlation between SE and UV that remains robust after controlling for PC ($p_\mathrm{null}\lesssim 10^{-7}$). In contrast, the correlation between SE and PC is weaker but still statistically significant ($p_\mathrm{null}\lesssim 5\times 10^{-2}$). These findings suggest that, in addition to ionized reflection -- a natural outcome of the hot corona illuminating the disk -- a warm corona component is essential, and may even dominate, in producing the soft excess. Additionally, we report a mild anti-correlation between SE strength ($q$) and PC photon index ($Γ_\mathrm{PC}$) ($p_\mathrm{null}=10^{-2}$), suggesting a potential competition between the warm and hot coronae. Finally, we find that the $Γ_\mathrm{PC}$ values we derived with SE properly incorporated exhibit a much weaker correlation with $λ_\mathrm{Edd}$ ($p_\mathrm{null}=2\times 10^{-2}$) than previously reported in the literature. This highlights the critical role of accurately modeling SE in studies of the $Γ_\mathrm{PC}$--$λ_\mathrm{Edd}$ relation.
△ Less
Submitted 18 October, 2025;
originally announced October 2025.
-
Vu's conjecture holds for claw-free graphs
Authors:
Linda Cook,
Ross J. Kang,
Eileen Robinson,
Gabriëlle Zwaneveld
Abstract:
Given a graph $G$, let $Δ_2(G)$ denote the maximum number of neighbors any two distinct vertices of $G$ have in common. Vu (2002) proposed that, provided $Δ_2(G)$ is not too small as a proportion of the maximum degree $Δ(G)$ of $G$, the chromatic number of $G$ should never be too much larger than $Δ_2(G)$. We make a first approach towards Vu's conjecture from a structural graph theoretic point of…
▽ More
Given a graph $G$, let $Δ_2(G)$ denote the maximum number of neighbors any two distinct vertices of $G$ have in common. Vu (2002) proposed that, provided $Δ_2(G)$ is not too small as a proportion of the maximum degree $Δ(G)$ of $G$, the chromatic number of $G$ should never be too much larger than $Δ_2(G)$. We make a first approach towards Vu's conjecture from a structural graph theoretic point of view. We prove that, in the case where $G$ is claw-free, indeed the chromatic number of $G$ is at most $Δ_2(G)+3$. This is tight, as our bound is met with equality for the line graph of the Petersen graph. Moreover, we can prove this in terms of the more specific parameter that bounds the maximum number of neighbors any two endpoints of some edge of $G$ have in common. Our result may be viewed as a generalization of the classic bound of Vizing (1964) for edge-coloring.
△ Less
Submitted 17 October, 2025;
originally announced October 2025.
-
DeepAries: Adaptive Rebalancing Interval Selection for Enhanced Portfolio Selection
Authors:
Jinkyu Kim,
Hyunjung Yi,
Mogan Gim,
Donghee Choi,
Jaewoo Kang
Abstract:
We propose DeepAries , a novel deep reinforcement learning framework for dynamic portfolio management that jointly optimizes the timing and allocation of rebalancing decisions. Unlike prior reinforcement learning methods that employ fixed rebalancing intervals regardless of market conditions, DeepAries adaptively selects optimal rebalancing intervals along with portfolio weights to reduce unnecess…
▽ More
We propose DeepAries , a novel deep reinforcement learning framework for dynamic portfolio management that jointly optimizes the timing and allocation of rebalancing decisions. Unlike prior reinforcement learning methods that employ fixed rebalancing intervals regardless of market conditions, DeepAries adaptively selects optimal rebalancing intervals along with portfolio weights to reduce unnecessary transaction costs and maximize risk-adjusted returns. Our framework integrates a Transformer-based state encoder, which effectively captures complex long-term market dependencies, with Proximal Policy Optimization (PPO) to generate simultaneous discrete (rebalancing intervals) and continuous (asset allocations) actions. Extensive experiments on multiple real-world financial markets demonstrate that DeepAries significantly outperforms traditional fixed-frequency and full-rebalancing strategies in terms of risk-adjusted returns, transaction costs, and drawdowns. Additionally, we provide a live demo of DeepAries at https://deep-aries.github.io/, along with the source code and dataset at https://github.com/dmis-lab/DeepAries, illustrating DeepAries' capability to produce interpretable rebalancing and allocation decisions aligned with shifting market regimes. Overall, DeepAries introduces an innovative paradigm for adaptive and practical portfolio management by integrating both timing and allocation into a unified decision-making process.
△ Less
Submitted 11 September, 2025;
originally announced October 2025.
-
Improved Absolute Polarization Calibrator for BICEP CMB Polarimeters
Authors:
A. R. Polish,
P. A. R. Ade,
Z. Ahmed,
M. Amiri,
D. Barkats,
R. Basu Thakur,
C. A. Bischoff,
D. Beck,
J. J. Bock,
H. Boenish,
V. Buza,
B. Cantrall,
J. R. Cheshire IV,
J. Connors,
J. Cornelison,
M. Crumrine,
A. J. Cukierman,
E. Denison,
L. Duband,
M. Echter,
M. Eiben,
B. D. Elwood,
S. Fatigoni,
J. P. Filippini,
A. Fortes
, et al. (67 additional authors not shown)
Abstract:
Cosmic birefringence is a hypothesized parity violation in electromagnetism that predicts a frequency-independent polarization rotation as light propagates. This would rotate the light from the Cosmic Microwave Background, producing an unexpected EB correlation. However, cosmic birefringence angle is degenerate with instrument polarization angle, and breaking this degeneracy requires an absolute p…
▽ More
Cosmic birefringence is a hypothesized parity violation in electromagnetism that predicts a frequency-independent polarization rotation as light propagates. This would rotate the light from the Cosmic Microwave Background, producing an unexpected EB correlation. However, cosmic birefringence angle is degenerate with instrument polarization angle, and breaking this degeneracy requires an absolute polarization calibration. We calibrate the BICEP3 telescope (a 95GHz CMB polarimeter) by observing a rotating polarized source (RPS) with both the telescope and a small test receiver called the In-Situ Absolute Angle Calibrator (ISAAC).
△ Less
Submitted 14 October, 2025;
originally announced October 2025.
-
Simultaneous Calibration of Noise Covariance and Kinematics for State Estimation of Legged Robots via Bi-level Optimization
Authors:
Denglin Cheng,
Jiarong Kang,
Xiaobin Xiong
Abstract:
Accurate state estimation is critical for legged and aerial robots operating in dynamic, uncertain environments. A key challenge lies in specifying process and measurement noise covariances, which are typically unknown or manually tuned. In this work, we introduce a bi-level optimization framework that jointly calibrates covariance matrices and kinematic parameters in an estimator-in-the-loop mann…
▽ More
Accurate state estimation is critical for legged and aerial robots operating in dynamic, uncertain environments. A key challenge lies in specifying process and measurement noise covariances, which are typically unknown or manually tuned. In this work, we introduce a bi-level optimization framework that jointly calibrates covariance matrices and kinematic parameters in an estimator-in-the-loop manner. The upper level treats noise covariances and model parameters as optimization variables, while the lower level executes a full-information estimator. Differentiating through the estimator allows direct optimization of trajectory-level objectives, resulting in accurate and consistent state estimates. We validate our approach on quadrupedal and humanoid robots, demonstrating significantly improved estimation accuracy and uncertainty calibration compared to hand-tuned baselines. Our method unifies state estimation, sensor, and kinematics calibration into a principled, data-driven framework applicable across diverse robotic platforms.
△ Less
Submitted 13 October, 2025;
originally announced October 2025.
-
Smart Contract-Enabled Procurement under Bounded Demand Variability: A Truncated Normal Approach
Authors:
Jinho Cha,
Youngchul Kim,
Junyeol Ryu,
Sangjun Park,
Jeongho Kang,
Hyeyoung Hwang
Abstract:
This study develops a strategic procurement framework integrating blockchain-based smart contracts with bounded demand variability modeled through a truncated normal distribution. While existing research emphasizes the technical feasibility of smart contracts, the operational and economic implications of adoption under moderate uncertainty remain underexplored. We propose a multi-supplier model in…
▽ More
This study develops a strategic procurement framework integrating blockchain-based smart contracts with bounded demand variability modeled through a truncated normal distribution. While existing research emphasizes the technical feasibility of smart contracts, the operational and economic implications of adoption under moderate uncertainty remain underexplored. We propose a multi-supplier model in which a centralized retailer jointly determines the optimal smart contract adoption intensity and supplier allocation decisions. The formulation endogenizes adoption costs, supplier digital readiness, and inventory penalties to capture realistic trade-offs among efficiency, sustainability, and profitability. Analytical results establish concavity and provide closed-form comparative statics for adoption thresholds and procurement quantities. Extensive numerical experiments demonstrate that moderate demand variability supports partial adoption strategies, whereas excessive investment in digital infrastructure can reduce overall profitability. Dynamic simulations further reveal how adaptive learning and declining implementation costs progressively enhance adoption intensity and supply chain performance. The findings provide theoretical and managerial insights for balancing digital transformation, resilience, and sustainability objectives in smart contract-enabled procurement.
△ Less
Submitted 9 October, 2025;
originally announced October 2025.
-
Constraints on inelastic dark matter from the CDEX-1B experiment
Authors:
Y. F. Liang,
L. T. Yang,
Q. Yue,
K. J. Kang,
Y. J. Li,
H. P. An,
Greeshma C.,
J. P. Chang,
H. Chen,
Y. H. Chen,
J. P. Cheng,
J. Y. Cui,
W. H. Dai,
Z. Deng,
Y. X. Dong,
C. H. Fang,
H. Gong,
Q. J. Guo,
T. Guo,
X. Y. Guo,
L. He,
J. R. He,
H. X. Huang,
T. C. Huang,
S. Karmakar
, et al. (63 additional authors not shown)
Abstract:
We present limits on spin-independent inelastic WIMP-nucleus scattering using the 737.1 kg $\cdot$ day dataset from the CDEX-1B experiment. Expected nuclear recoil spectra for various inelastic WIMP masses $m_χ$ and mass splittings $δ$ are calculated under the standard halo model. An accurate background model of CDEX-1B is constructed by simulating all major background sources. The model parameter…
▽ More
We present limits on spin-independent inelastic WIMP-nucleus scattering using the 737.1 kg $\cdot$ day dataset from the CDEX-1B experiment. Expected nuclear recoil spectra for various inelastic WIMP masses $m_χ$ and mass splittings $δ$ are calculated under the standard halo model. An accurate background model of CDEX-1B is constructed by simulating all major background sources. The model parameters are then determined through maximum likelihood estimation and Markov Chain Monte Carlo fitting. The resulting 90\% confidence level upper limits on the WIMP-nucleon cross section $σ_{\mathrm{n}}$ exclude certain DAMA/LIBRA allowed regions: the $χ^2 < 4$ regions for $δ< 30$ keV at $m_χ= 250$ GeV and the $χ^2 < 9$ region for $δ< 50$ keV at $m_χ= 500$ GeV. The method is applicable to other inelastic dark matter scenarios, and the upcoming CDEX-50 experiment is expected to improve sensitivity by four orders of magnitude.
△ Less
Submitted 9 October, 2025;
originally announced October 2025.
-
LMM-Incentive: Large Multimodal Model-based Incentive Design for User-Generated Content in Web 3.0
Authors:
Jinbo Wen,
Jiawen Kang,
Linfeng Zhang,
Xiaoying Tang,
Jianhang Tang,
Yang Zhang,
Zhaohui Yang,
Dusit Niyato
Abstract:
Web 3.0 represents the next generation of the Internet, which is widely recognized as a decentralized ecosystem that focuses on value expression and data ownership. By leveraging blockchain and artificial intelligence technologies, Web 3.0 offers unprecedented opportunities for users to create, own, and monetize their content, thereby enabling User-Generated Content (UGC) to an entirely new level.…
▽ More
Web 3.0 represents the next generation of the Internet, which is widely recognized as a decentralized ecosystem that focuses on value expression and data ownership. By leveraging blockchain and artificial intelligence technologies, Web 3.0 offers unprecedented opportunities for users to create, own, and monetize their content, thereby enabling User-Generated Content (UGC) to an entirely new level. However, some self-interested users may exploit the limitations of content curation mechanisms and generate low-quality content with less effort, obtaining platform rewards under information asymmetry. Such behavior can undermine Web 3.0 performance. To this end, we propose \textit{LMM-Incentive}, a novel Large Multimodal Model (LMM)-based incentive mechanism for UGC in Web 3.0. Specifically, we propose an LMM-based contract-theoretic model to motivate users to generate high-quality UGC, thereby mitigating the adverse selection problem from information asymmetry. To alleviate potential moral hazards after contract selection, we leverage LMM agents to evaluate UGC quality, which is the primary component of the contract, utilizing prompt engineering techniques to improve the evaluation performance of LMM agents. Recognizing that traditional contract design methods cannot effectively adapt to the dynamic environment of Web 3.0, we develop an improved Mixture of Experts (MoE)-based Proximal Policy Optimization (PPO) algorithm for optimal contract design. Simulation results demonstrate the superiority of the proposed MoE-based PPO algorithm over representative benchmarks in the context of contract design. Finally, we deploy the designed contract within an Ethereum smart contract framework, further validating the effectiveness of the proposed scheme.
△ Less
Submitted 6 October, 2025;
originally announced October 2025.
-
Forecasting-Based Biomedical Time-series Data Synthesis for Open Data and Robust AI
Authors:
Youngjoon Lee,
Seongmin Cho,
Yehhyun Jo,
Jinu Gong,
Hyunjoo Jenny Lee,
Joonhyuk Kang
Abstract:
The limited data availability due to strict privacy regulations and significant resource demands severely constrains biomedical time-series AI development, which creates a critical gap between data requirements and accessibility. Synthetic data generation presents a promising solution by producing artificial datasets that maintain the statistical properties of real biomedical time-series data with…
▽ More
The limited data availability due to strict privacy regulations and significant resource demands severely constrains biomedical time-series AI development, which creates a critical gap between data requirements and accessibility. Synthetic data generation presents a promising solution by producing artificial datasets that maintain the statistical properties of real biomedical time-series data without compromising patient confidentiality. We propose a framework for synthetic biomedical time-series data generation based on advanced forecasting models that accurately replicates complex electrophysiological signals such as EEG and EMG with high fidelity. These synthetic datasets preserve essential temporal and spectral properties of real data, which enables robust analysis while effectively addressing data scarcity and privacy challenges. Our evaluations across multiple subjects demonstrate that the generated synthetic data can serve as an effective substitute for real data and also significantly boost AI model performance. The approach maintains critical biomedical features while provides high scalability for various applications and integrates seamlessly into open-source repositories, substantially expanding resources for AI-driven biomedical research.
△ Less
Submitted 6 October, 2025;
originally announced October 2025.
-
Relief of EGFR/FOS-downregulated miR-103a by loganin alleviates NF-kappaB-triggered inflammation and gut barrier disruption in colitis
Authors:
Yan Li,
Teng Hui,
Xinhui Zhang,
Zihan Cao,
Ping Wang,
Shirong Chen,
Ke Zhao,
Yiran Liu,
Yue Yuan,
Dou Niu,
Xiaobo Yu,
Gan Wang,
Changli Wang,
Yan Lin,
Fan Zhang,
Hefang Wu,
Guodong Feng,
Yan Liu,
Jiefang Kang,
Yaping Yan,
Hai Zhang,
Xiaochang Xue,
Xun Jiang
Abstract:
Due to the ever-rising global incidence rate of inflammatory bowel disease (IBD) and the lack of effective clinical treatment drugs, elucidating the detailed pathogenesis, seeking novel targets, and developing promising drugs are the top priority for IBD treatment. Here, we demonstrate that the levels of microRNA (miR)-103a were significantly downregulated in the inflamed mucosa of ulcerative coli…
▽ More
Due to the ever-rising global incidence rate of inflammatory bowel disease (IBD) and the lack of effective clinical treatment drugs, elucidating the detailed pathogenesis, seeking novel targets, and developing promising drugs are the top priority for IBD treatment. Here, we demonstrate that the levels of microRNA (miR)-103a were significantly downregulated in the inflamed mucosa of ulcerative colitis (UC) patients, along with elevated inflammatory cytokines (IL-1beta/TNF-alpha) and reduced tight junction protein (Occludin/ZO-1) levels, as compared with healthy control objects. Consistently, miR-103a deficient intestinal epithelial cells Caco-2 showed serious inflammatory responses and increased permeability, and DSS induced more severe colitis in miR-103a-/- mice than wild-type ones. Mechanistic studies unraveled that c-FOS suppressed miR-103a transcription via binding to its promoter, then miR-103a-targeted NF-kappaB activation contributes to inflammatory responses and barrier disruption by targeting TAB2 and TAK1. Notably, the traditional Chinese medicine Cornus officinalis (CO) and its core active ingredient loganin potently mitigated inflammation and barrier disruption in UC by specifically blocking the EGFR/RAS/ERK/c-FOS signaling axis, these effects mainly attributed to modulated miR-103a levels as the therapeutic activities of them were almost completely shielded in miR-103a KO mice. Taken together, this work reveals that loganin relieves EGFR/c-FOS axis-suppressed epithelial miR-103a expression, thereby inhibiting NF-kappaB pathway activation, suppressing inflammatory responses, and preserving tight junction integrity in UC. Thus, our data enrich mechanistic insights and promising targets for UC treatment.
△ Less
Submitted 5 October, 2025;
originally announced October 2025.
-
ASGuard: Activation-Scaling Guard to Mitigate Targeted Jailbreaking Attack
Authors:
Yein Park,
Jungwoo Park,
Jaewoo Kang
Abstract:
Large language models (LLMs), despite being safety-aligned, exhibit brittle refusal behaviors that can be circumvented by simple linguistic changes. As tense jailbreaking demonstrates that models refusing harmful requests often comply when rephrased in past tense, a critical generalization gap is revealed in current alignment methods whose underlying mechanisms are poorly understood. In this work,…
▽ More
Large language models (LLMs), despite being safety-aligned, exhibit brittle refusal behaviors that can be circumvented by simple linguistic changes. As tense jailbreaking demonstrates that models refusing harmful requests often comply when rephrased in past tense, a critical generalization gap is revealed in current alignment methods whose underlying mechanisms are poorly understood. In this work, we introduce Activation-Scaling Guard (ASGuard), an insightful, mechanistically-informed framework that surgically mitigates this specific vulnerability. For the first step, we use circuit analysis to identify the specific attention heads causally linked to the targeted jailbreaking, the tense-changing attack. Second, we train a precise, channel-wise scaling vector to recalibrate the activation of tense vulnerable heads. Lastly, we apply it into a "preventative fine-tuning", forcing the model to learn a more robust refusal mechanism. Across three LLMs, ASGuard effectively reduces the attack success rate of targeted jailbreaking while preserving general capabilities and minimizing over refusal, achieving a Pareto-optimal balance between safety and utility. Our findings underscore how adversarial suffixes suppress the propagation of the refusal-mediating direction, based on mechanistic analysis. Furthermore, our work showcases how a deep understanding of model internals can be leveraged to develop practical, efficient, and targeted methods for adjusting model behavior, charting a course for more reliable and interpretable AI safety.
△ Less
Submitted 30 September, 2025;
originally announced September 2025.
-
Thinking Sparks!: Emergent Attention Heads in Reasoning Models During Post Training
Authors:
Yein Park,
Minbyul Jeong,
Jaewoo Kang
Abstract:
The remarkable capabilities of modern large reasoning models are largely unlocked through post-training techniques such as supervised fine-tuning and reinforcement learning. However, the architectural mechanisms behind such improvements remain largely opaque. In this work, we use circuit analysis to demonstrate that post-training for complex reasoning sparks the emergence of novel, functionally sp…
▽ More
The remarkable capabilities of modern large reasoning models are largely unlocked through post-training techniques such as supervised fine-tuning and reinforcement learning. However, the architectural mechanisms behind such improvements remain largely opaque. In this work, we use circuit analysis to demonstrate that post-training for complex reasoning sparks the emergence of novel, functionally specialized attention heads. These heads collectively support structured reasoning and computation. Our comparative analysis across Qwen families and DeepSeek-distilled model reveals that these emergent heads evolve differently under different training regimes. Distillation and SFT foster a cumulative addition of stable reasoning heads. In contrast, group relative policy optimization operates in a dynamic search mode: relatively few attention heads are iteratively activated, evaluated, and pruned, with their survival closely tracking fluctuations in the task reward signal. Furthermore, we find that controllable think on/off models do not possess dedicated thinking heads. Instead, turning off explicit reasoning triggers a broader-but less efficient-set of compensatory heads. Through ablation and qualitative analyses, we connect these circuit-level dynamics to a crucial performance trade-off: strengthened heads enable sophisticated problem-solving strategies for difficult problems but can also introduce over-thinking failure modes, such as calculation errors or logical loops on simpler tasks. These findings connect circuit-level dynamics to macro-level performance, identifying an inherent tension where complex reasoning comes at the cost of elementary computations. More broadly, our work points to future directions for training policy design, emphasizing the need to balance the development of effective reasoning strategies with the assurance of reliable, flawless execution.
△ Less
Submitted 30 September, 2025;
originally announced September 2025.
-
Enhancing Linear Attention with Residual Learning
Authors:
Xunhao Lai,
Jialiang Kang,
Jianqiao Lu,
Tong Lin,
Pengyu Zhao
Abstract:
Linear attention offers a linear-time alternative to self-attention but often struggles to capture long-range patterns. We revisit linear attention through a prediction-correction lens and show that prevalent variants can be written as a combination of a historical prediction and a single-token correction, which creates an expressivity bottleneck. To address this bottleneck, we introduce Residual…
▽ More
Linear attention offers a linear-time alternative to self-attention but often struggles to capture long-range patterns. We revisit linear attention through a prediction-correction lens and show that prevalent variants can be written as a combination of a historical prediction and a single-token correction, which creates an expressivity bottleneck. To address this bottleneck, we introduce Residual Linear Attention (RLA), a framework that equips linear attention with an explicit residual-fitting mechanism. RLA maintains an auxiliary recurrent state that learns to accumulate residual errors over time and correct the base prediction. We further instantiate a delta-rule version, Residual Delta Net (RDN), incorporating adaptive gating and residual clipping for enhanced correction control and stability. Our implementation leverages highly optimized linear attention kernels and preserves linear time and memory. Across language modeling and recall-intensive evaluations, RLA and RDN consistently outperform their respective baselines and other modern linear-attention methods, narrowing the gap to standard Transformers while retaining linear scaling.
△ Less
Submitted 24 September, 2025;
originally announced September 2025.
-
C3-OWD: A Curriculum Cross-modal Contrastive Learning Framework for Open-World Detection
Authors:
Siheng Wang,
Zhengdao Li,
Yanshu Li,
Canran Xiao,
Haibo Zhan,
Zhengtao Yao,
Xuzhi Zhang,
Jiale Kang,
Linshan Li,
Weiming Liu,
Zhikang Dong,
Jifeng Shen,
Junhao Dong,
Qiang Sun,
Piotr Koniusz
Abstract:
Object detection has advanced significantly in the closed-set setting, but real-world deployment remains limited by two challenges: poor generalization to unseen categories and insufficient robustness under adverse conditions. Prior research has explored these issues separately: visible-infrared detection improves robustness but lacks generalization, while open-world detection leverages vision-lan…
▽ More
Object detection has advanced significantly in the closed-set setting, but real-world deployment remains limited by two challenges: poor generalization to unseen categories and insufficient robustness under adverse conditions. Prior research has explored these issues separately: visible-infrared detection improves robustness but lacks generalization, while open-world detection leverages vision-language alignment strategy for category diversity but struggles under extreme environments. This trade-off leaves robustness and diversity difficult to achieve simultaneously. To mitigate these issues, we propose \textbf{C3-OWD}, a curriculum cross-modal contrastive learning framework that unifies both strengths. Stage~1 enhances robustness by pretraining with RGBT data, while Stage~2 improves generalization via vision-language alignment. To prevent catastrophic forgetting between two stages, we introduce an Exponential Moving Average (EMA) mechanism that theoretically guarantees preservation of pre-stage performance with bounded parameter lag and function consistency. Experiments on FLIR, OV-COCO, and OV-LVIS demonstrate the effectiveness of our approach: C3-OWD achieves $80.1$ AP$^{50}$ on FLIR, $48.6$ AP$^{50}_{\text{Novel}}$ on OV-COCO, and $35.7$ mAP$_r$ on OV-LVIS, establishing competitive performance across both robustness and diversity evaluations. Code available at: https://github.com/justin-herry/C3-OWD.git.
△ Less
Submitted 27 September, 2025;
originally announced September 2025.
-
BICEP/Keck XX: Component-separated maps of polarized CMB and thermal dust emission using Planck and BICEP/Keck Observations through the 2018 Observing Season
Authors:
BICEP/Keck Collaboration,
:,
P. A. R. Ade,
Z. Ahmed,
M. Amiri,
D. Barkats,
R. Basu Thakur,
C. A. Bischoff,
D. Beck,
J. J. Bock,
H. Boenish,
V. Buza,
B. Cantrall,
J. R. Cheshire IV,
J. Connors,
J. Cornelison,
M. Crumrine,
A. J. Cukierman,
E. Denison,
L. Duband,
M. Echter,
M. Eiben,
B. D. Elwood,
S. Fatigoni,
J. P. Filippini
, et al. (73 additional authors not shown)
Abstract:
We present component-separated polarization maps of the cosmic microwave background (CMB) and Galactic thermal dust emission, derived using data from the BICEP/Keck experiments through the 2018 observing season and Planck. By employing a maximum-likelihood method that utilizes observing matrices, we produce unbiased maps of the CMB and dust signals. We outline the computational challenges and demo…
▽ More
We present component-separated polarization maps of the cosmic microwave background (CMB) and Galactic thermal dust emission, derived using data from the BICEP/Keck experiments through the 2018 observing season and Planck. By employing a maximum-likelihood method that utilizes observing matrices, we produce unbiased maps of the CMB and dust signals. We outline the computational challenges and demonstrate an efficient implementation of the component map estimator. We show methods to compute and characterize power spectra of these maps, opening up an alternative way to infer the tensor-to-scalar ratio from our data. We compare the results of this map-based separation method with the baseline BICEP/Keck analysis. Our analysis demonstrates consistency between the two methods, finding an 84% correlation between the pipelines.
△ Less
Submitted 25 September, 2025;
originally announced September 2025.
-
Orbital magnetization and magnetic susceptibility of interacting electrons
Authors:
Jian Kang,
Minxuan Wang,
Oskar Vafek
Abstract:
We present a rigorous derivation of the orbital magnetization formula for interacting electrons within the self-consistent Hartree-Fock approximation. Our results are expressed entirely in terms of the self-consistent wavefunctions and the Hartree-Fock energy spectrum at zero magnetic field. We test the formula on an interacting Rashba model, finding an agreement with calculations performed at sma…
▽ More
We present a rigorous derivation of the orbital magnetization formula for interacting electrons within the self-consistent Hartree-Fock approximation. Our results are expressed entirely in terms of the self-consistent wavefunctions and the Hartree-Fock energy spectrum at zero magnetic field. We test the formula on an interacting Rashba model, finding an agreement with calculations performed at small but non-zero external magnetic field. Our method allows us to also derive formulas for the orbital magnetic susceptibility.
△ Less
Submitted 24 September, 2025;
originally announced September 2025.
-
Analyzing Uncertainty of LLM-as-a-Judge: Interval Evaluations with Conformal Prediction
Authors:
Huanxin Sheng,
Xinyi Liu,
Hangfeng He,
Jieyu Zhao,
Jian Kang
Abstract:
LLM-as-a-judge has become a promising paradigm for using large language models (LLMs) to evaluate natural language generation (NLG), but the uncertainty of its evaluation remains underexplored. This lack of reliability may limit its deployment in many applications. This work presents the first framework to analyze the uncertainty by offering a prediction interval of LLM-based scoring via conformal…
▽ More
LLM-as-a-judge has become a promising paradigm for using large language models (LLMs) to evaluate natural language generation (NLG), but the uncertainty of its evaluation remains underexplored. This lack of reliability may limit its deployment in many applications. This work presents the first framework to analyze the uncertainty by offering a prediction interval of LLM-based scoring via conformal prediction. Conformal prediction constructs continuous prediction intervals from a single evaluation run, and we design an ordinal boundary adjustment for discrete rating tasks. We also suggest a midpoint-based score within the interval as a low-bias alternative to raw model score and weighted average. We perform extensive experiments and analysis, which show that conformal prediction can provide valid prediction interval with coverage guarantees. We also explore the usefulness of interval midpoint and judge reprompting for better judgment.
△ Less
Submitted 23 September, 2025;
originally announced September 2025.
-
Self-Evolving LLMs via Continual Instruction Tuning
Authors:
Jiazheng Kang,
Le Huang,
Cheng Hou,
Zhe Zhao,
Zhenxiang Yan,
Ting Bai
Abstract:
In real-world industrial settings, large language models (LLMs) must learn continually to keep pace with diverse and evolving tasks, requiring self-evolution to refine knowledge under dynamic data distributions. However, existing continual learning (CL) approaches, such as replay and parameter isolation, often suffer from catastrophic forgetting: training on new tasks degrades performance on earli…
▽ More
In real-world industrial settings, large language models (LLMs) must learn continually to keep pace with diverse and evolving tasks, requiring self-evolution to refine knowledge under dynamic data distributions. However, existing continual learning (CL) approaches, such as replay and parameter isolation, often suffer from catastrophic forgetting: training on new tasks degrades performance on earlier ones by overfitting to the new distribution and weakening generalization.We propose MoE-CL, a parameter-efficient adversarial mixture-of-experts framework for industrial-scale, self-evolving continual instruction tuning of LLMs. MoE-CL uses a dual-expert design: (1) a dedicated LoRA expert per task to preserve task-specific knowledge via parameter independence, mitigating forgetting; and (2) a shared LoRA expert to enable cross-task transfer. To prevent transferring task-irrelevant noise through the shared pathway, we integrate a task-aware discriminator within a GAN. The discriminator encourages the shared expert to pass only task-aligned information during sequential training. Through adversarial learning, the shared expert acquires generalized representations that mimic the discriminator, while dedicated experts retain task-specific details, balancing knowledge retention and cross-task generalization and thereby supporting self-evolution.Extensive experiments on the public MTL5 benchmark and an industrial Tencent3 benchmark validate the effectiveness of MoE-CL for continual instruction tuning. In real-world A/B testing for content compliance review on the Tencent Video platform, MoE-CL reduced manual review costs by 15.3%. These results demonstrate that MoE-CL is practical for large-scale industrial deployment where continual adaptation and stable transfer are critical.
△ Less
Submitted 14 October, 2025; v1 submitted 14 September, 2025;
originally announced September 2025.
-
WenetSpeech-Chuan: A Large-Scale Sichuanese Corpus with Rich Annotation for Dialectal Speech Processing
Authors:
Yuhang Dai,
Ziyu Zhang,
Shuai Wang,
Longhao Li,
Zhao Guo,
Tianlun Zuo,
Shuiyuan Wang,
Hongfei Xue,
Chengyou Wang,
Qing Wang,
Xin Xu,
Hui Bu,
Jie Li,
Jian Kang,
Binbin Zhang,
Lei Xie
Abstract:
The scarcity of large-scale, open-source data for dialects severely hinders progress in speech technology, a challenge particularly acute for the widely spoken Sichuanese dialects of Chinese. To address this critical gap, we introduce WenetSpeech-Chuan, a 10,000-hour, richly annotated corpus constructed using our novel Chuan-Pipeline, a complete data processing framework for dialectal speech. To f…
▽ More
The scarcity of large-scale, open-source data for dialects severely hinders progress in speech technology, a challenge particularly acute for the widely spoken Sichuanese dialects of Chinese. To address this critical gap, we introduce WenetSpeech-Chuan, a 10,000-hour, richly annotated corpus constructed using our novel Chuan-Pipeline, a complete data processing framework for dialectal speech. To facilitate rigorous evaluation and demonstrate the corpus's effectiveness, we also release high-quality ASR and TTS benchmarks, WenetSpeech-Chuan-Eval, with manually verified transcriptions. Experiments show that models trained on WenetSpeech-Chuan achieve state-of-the-art performance among open-source systems and demonstrate results comparable to commercial services. As the largest open-source corpus for Sichuanese dialects, WenetSpeech-Chuan not only lowers the barrier to research in dialectal speech processing but also plays a crucial role in promoting AI equity and mitigating bias in speech technologies. The corpus, benchmarks, models, and receipts are publicly available on our project page.
△ Less
Submitted 22 September, 2025;
originally announced September 2025.
-
Doubly Robust Estimation of Continuous Outcomes under Multiple Treatment Levels via GPS, CBPS, and Penalized Empirical Likelihood
Authors:
Byeonghee Lee,
Joonsung Kang
Abstract:
This paper develops a unified framework for estimating continuous outcomes under multiple treatment levels in observational studies. We integrate the Generalized Propensity Score (GPS), Covariate Balancing Propensity Score (CBPS), and outcome regression into a Penalized Empirical Likelihood (PEL) formulation. The GPS is parameterized by $\boldsymbolβ$ and denoted $π_{\boldsymbolβ}(\mathbf{X})$, wh…
▽ More
This paper develops a unified framework for estimating continuous outcomes under multiple treatment levels in observational studies. We integrate the Generalized Propensity Score (GPS), Covariate Balancing Propensity Score (CBPS), and outcome regression into a Penalized Empirical Likelihood (PEL) formulation. The GPS is parameterized by $\boldsymbolβ$ and denoted $π_{\boldsymbolβ}(\mathbf{X})$, while CBPS imposes moment conditions to ensure covariate balance. Outcome regression flexibly models the continuous response $Y$, and doubly robust estimation ensures consistency under either correct model specification. PEL allows simultaneous estimation and variable selection using general estimating equations. Simulation results and comparisons with state-of-the-art meta-learners confirm the effectiveness of our method.
△ Less
Submitted 19 September, 2025;
originally announced September 2025.
-
Time-inconsistent reinsurance and investment optimization problem with delay under random risk aversion
Authors:
Jian-hao Kang,
Zhun Gou,
Nan-jing Huang
Abstract:
This paper considers a newly delayed reinsurance and investment optimization problem incorporating random risk aversion, in which an insurer pursues maximization of the expected certainty equivalent of her/his terminal wealth and the cumulative delayed information of the wealth over a period. Specially, the insurer's surplus dynamics are approximated using a drifted Brownian motion, while the fina…
▽ More
This paper considers a newly delayed reinsurance and investment optimization problem incorporating random risk aversion, in which an insurer pursues maximization of the expected certainty equivalent of her/his terminal wealth and the cumulative delayed information of the wealth over a period. Specially, the insurer's surplus dynamics are approximated using a drifted Brownian motion, while the financial market is described by the Black-Scholes model. Moreover, the performance-linked capital flow feature is incorporated and the wealth process is formulated via a stochastic delay differential equation (SDDE). By adopting a game-theoretic approach, a verification theorem with rigorous proofs is established to capture the equilibrium reinsurance and investment strategy along with the equilibrium value function. Furthermore, for the cases of exponential utility and power utility, analytical or semi-analytical equilibrium reinsurance and investment strategies together with their equilibrium value functions are obtained under mild conditions. Finally, several numerical experiments are conducted to analyze the behavioral characteristics of the freshly-derived equilibrium reinsurance and investment strategy.
△ Less
Submitted 18 September, 2025;
originally announced September 2025.
-
ViSpec: Accelerating Vision-Language Models with Vision-Aware Speculative Decoding
Authors:
Jialiang Kang,
Han Shu,
Wenshuo Li,
Yingjie Zhai,
Xinghao Chen
Abstract:
Speculative decoding is a widely adopted technique for accelerating inference in large language models (LLMs), yet its application to vision-language models (VLMs) remains underexplored, with existing methods achieving only modest speedups (<1.5x). This gap is increasingly significant as multimodal capabilities become central to large-scale models. We hypothesize that large VLMs can effectively fi…
▽ More
Speculative decoding is a widely adopted technique for accelerating inference in large language models (LLMs), yet its application to vision-language models (VLMs) remains underexplored, with existing methods achieving only modest speedups (<1.5x). This gap is increasingly significant as multimodal capabilities become central to large-scale models. We hypothesize that large VLMs can effectively filter redundant image information layer by layer without compromising textual comprehension, whereas smaller draft models struggle to do so. To address this, we introduce Vision-Aware Speculative Decoding (ViSpec), a novel framework tailored for VLMs. ViSpec employs a lightweight vision adaptor module to compress image tokens into a compact representation, which is seamlessly integrated into the draft model's attention mechanism while preserving original image positional information. Additionally, we extract a global feature vector for each input image and augment all subsequent text tokens with this feature to enhance multimodal coherence. To overcome the scarcity of multimodal datasets with long assistant responses, we curate a specialized training dataset by repurposing existing datasets and generating extended outputs using the target VLM with modified prompts. Our training strategy mitigates the risk of the draft model exploiting direct access to the target model's hidden states, which could otherwise lead to shortcut learning when training solely on target model outputs. Extensive experiments validate ViSpec, achieving, to our knowledge, the first substantial speedup in VLM speculative decoding. Code is available at https://github.com/KangJialiang/ViSpec.
△ Less
Submitted 23 October, 2025; v1 submitted 17 September, 2025;
originally announced September 2025.
-
Automated Triaging and Transfer Learning of Incident Learning Safety Reports Using Large Language Representational Models
Authors:
Peter Beidler,
Mark Nguyen,
Kevin Lybarger,
Ola Holmberg,
Eric Ford,
John Kang
Abstract:
PURPOSE: Incident reports are an important tool for safety and quality improvement in healthcare, but manual review is time-consuming and requires subject matter expertise. Here we present a natural language processing (NLP) screening tool to detect high-severity incident reports in radiation oncology across two institutions.
METHODS AND MATERIALS: We used two text datasets to train and evaluate…
▽ More
PURPOSE: Incident reports are an important tool for safety and quality improvement in healthcare, but manual review is time-consuming and requires subject matter expertise. Here we present a natural language processing (NLP) screening tool to detect high-severity incident reports in radiation oncology across two institutions.
METHODS AND MATERIALS: We used two text datasets to train and evaluate our NLP models: 7,094 reports from our institution (Inst.), and 571 from IAEA SAFRON (SF), all of which had severity scores labeled by clinical content experts. We trained and evaluated two types of models: baseline support vector machines (SVM) and BlueBERT which is a large language model pretrained on PubMed abstracts and hospitalized patient data. We assessed for generalizability of our model in two ways. First, we evaluated models trained using Inst.-train on SF-test. Second, we trained a BlueBERT_TRANSFER model that was first fine-tuned on Inst.-train then on SF-train before testing on SF-test set. To further analyze model performance, we also examined a subset of 59 reports from our Inst. dataset, which were manually edited for clarity.
RESULTS Classification performance on the Inst. test achieved AUROC 0.82 using SVM and 0.81 using BlueBERT. Without cross-institution transfer learning, performance on the SF test was limited to an AUROC of 0.42 using SVM and 0.56 using BlueBERT. BlueBERT_TRANSFER, which was fine-tuned on both datasets, improved the performance on SF test to AUROC 0.78. Performance of SVM, and BlueBERT_TRANSFER models on the manually curated Inst. reports (AUROC 0.85 and 0.74) was similar to human performance (AUROC 0.81).
CONCLUSION: In summary, we successfully developed cross-institution NLP models on incident report text from radiation oncology centers. These models were able to detect high-severity reports similarly to humans on a curated dataset.
△ Less
Submitted 17 September, 2025;
originally announced September 2025.
-
Transverse single-spin asymmetry of forward $η$ mesons in $p^{\uparrow}+ p$ collisions at $\sqrt{s} = 200$ GeV
Authors:
PHENIX Collaboration,
N. J. Abdulameer,
U. Acharya,
C. Aidala,
N. N. Ajitanand,
Y. Akiba,
R. Akimoto,
J. Alexander,
D. Anderson,
S. Antsupov,
K. Aoki,
N. Apadula,
H. Asano,
E. T. Atomssa,
T. C. Awes,
B. Azmoun,
V. Babintsev,
M. Bai,
X. Bai,
B. Bannier,
E. Bannikov,
K. N. Barish,
S. Bathe,
V. Baublis,
C. Baumann
, et al. (359 additional authors not shown)
Abstract:
Utilizing the 2012 transversely polarized proton data from the Relativistic Heavy Ion Collider at Brookhaven National Laboratory, the forward $η$-meson transverse single-spin asymmetry ($A_N$) was measured for $p^{\uparrow}+p$ collisions at $\sqrt{s}=200$ GeV as a function of Feynman-x ($x_F$) for $0.2<|x_F|<0.8$ and transverse momentum ($p_T$) for $1.0<p_T<5.0$ GeV/$c$. Large asymmetries at posit…
▽ More
Utilizing the 2012 transversely polarized proton data from the Relativistic Heavy Ion Collider at Brookhaven National Laboratory, the forward $η$-meson transverse single-spin asymmetry ($A_N$) was measured for $p^{\uparrow}+p$ collisions at $\sqrt{s}=200$ GeV as a function of Feynman-x ($x_F$) for $0.2<|x_F|<0.8$ and transverse momentum ($p_T$) for $1.0<p_T<5.0$ GeV/$c$. Large asymmetries at positive $x_F$ are observed ($\left<A_N\right>=0.086 \pm 0.019$), agreeing well with previous measurements of $π^{0}$ and $η$ $A_N$, but with reach to higher $x_F$ and $p_T$. The contribution of initial-state spin-momentum correlations to the asymmetry, as calculated in the collinear twist-3 framework, appears insufficient to describe the data and suggests a significant impact on the asymmetry from fragmentation.
△ Less
Submitted 16 September, 2025;
originally announced September 2025.
-
K-Theory and Structural Properties of $C^*$-Algebras Associated with Relative Generalized Boolean Dynamical Systems
Authors:
Toke Meier Carlsen,
Eun Ji Kang
Abstract:
We present an explicit formula for the $K$-theory of the $C^*$-algebra associated with a relative generalized Boolean dynamical system $(\CB, \CL, θ, \CI_\af; \CJ)$. In particular, we find concrete generators for the $K_1$-group of $C^*(\CB, \CL, θ, \CI_\af; \CJ)$. We also prove that every gauge-invariant ideal of $C^*(\CB, \CL, θ, \CI_\af; \CJ)$ is Morita equivalent to a $C^*$-algebra of a relati…
▽ More
We present an explicit formula for the $K$-theory of the $C^*$-algebra associated with a relative generalized Boolean dynamical system $(\CB, \CL, θ, \CI_\af; \CJ)$. In particular, we find concrete generators for the $K_1$-group of $C^*(\CB, \CL, θ, \CI_\af; \CJ)$. We also prove that every gauge-invariant ideal of $C^*(\CB, \CL, θ, \CI_\af; \CJ)$ is Morita equivalent to a $C^*$-algebra of a relative generalized Boolean dynamical system.
As a structural application, we show that if the underlying Boolean dynamical system $(\CB, \CL, θ)$ satisfies Condition (K), then the associated $C^*$-algebra is $K_0$-liftable. Furthermore, we deduce that if $C^*(\CB, \CL, θ, \CI_\af; \CJ)$ is separable and purely infinite, then it has real rank zero.
△ Less
Submitted 16 September, 2025;
originally announced September 2025.
-
Outlier-Resistant Heterogeneous Treatment Effect Estimation in HDLSS Settings via GAT--CVAE Framework
Authors:
Byeonghee Lee,
Joonsung Kang
Abstract:
We introduce a robust framework for heterogeneous treatment effect (HTE) estimation tailored to high-dimensional low sample size (HDLSS) settings. By combining Graph Attention Networks (GAT) to capture structural dependencies among confounders with a Conditional Variational Autoencoder (CVAE) for latent representation learning, our method expands the sample space and performs clustering that integ…
▽ More
We introduce a robust framework for heterogeneous treatment effect (HTE) estimation tailored to high-dimensional low sample size (HDLSS) settings. By combining Graph Attention Networks (GAT) to capture structural dependencies among confounders with a Conditional Variational Autoencoder (CVAE) for latent representation learning, our method expands the sample space and performs clustering that integrates even outlier sets into coherent subgroups. Clusterwise causal effects are then estimated using a doubly robust outlier-resistant estimator, yielding stable and generalizable results. Simulations and real-world applications confirm superior performance compared with existing HTE methods, highlighting the framework's potential for precision medicine and policy evaluation.
△ Less
Submitted 12 September, 2025;
originally announced September 2025.
-
An Interpretable Ensemble Framework for Multi-Omics Dementia Biomarker Discovery Under HDLSS Conditions
Authors:
Byeonghee Lee,
Joonsung Kang
Abstract:
Biomarker discovery in neurodegenerative diseases requires robust, interpretable frameworks capable of integrating high-dimensional multi-omics data under low-sample conditions. We propose a novel ensemble approach combining Graph Attention Networks (GAT), MultiOmics Variational AutoEncoder (MOVE), Elastic-net sparse regression, and Storey's False Discovery Rate (FDR). This framework is benchmarke…
▽ More
Biomarker discovery in neurodegenerative diseases requires robust, interpretable frameworks capable of integrating high-dimensional multi-omics data under low-sample conditions. We propose a novel ensemble approach combining Graph Attention Networks (GAT), MultiOmics Variational AutoEncoder (MOVE), Elastic-net sparse regression, and Storey's False Discovery Rate (FDR). This framework is benchmarked against state-of-the-art methods including DIABLO, MOCAT, AMOGEL, and MOMLIN. We evaluate performance using both simulated multi-omics data and the Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset. Our method demonstrates superior predictive accuracy, feature selection precision, and biological relevance. Biomarker gene maps derived from both datasets are visualized and interpreted, offering insights into latent molecular mechanisms underlying dementia.
△ Less
Submitted 4 September, 2025;
originally announced September 2025.
-
GeneVA: A Dataset of Human Annotations for Generative Text to Video Artifacts
Authors:
Jenna Kang,
Maria Silva,
Patsorn Sangkloy,
Kenneth Chen,
Niall Williams,
Qi Sun
Abstract:
Recent advances in probabilistic generative models have extended capabilities from static image synthesis to text-driven video generation. However, the inherent randomness of their generation process can lead to unpredictable artifacts, such as impossible physics and temporal inconsistency. Progress in addressing these challenges requires systematic benchmarks, yet existing datasets primarily focu…
▽ More
Recent advances in probabilistic generative models have extended capabilities from static image synthesis to text-driven video generation. However, the inherent randomness of their generation process can lead to unpredictable artifacts, such as impossible physics and temporal inconsistency. Progress in addressing these challenges requires systematic benchmarks, yet existing datasets primarily focus on generative images due to the unique spatio-temporal complexities of videos. To bridge this gap, we introduce GeneVA, a large-scale artifact dataset with rich human annotations that focuses on spatio-temporal artifacts in videos generated from natural text prompts. We hope GeneVA can enable and assist critical applications, such as benchmarking model performance and improving generative video quality.
△ Less
Submitted 10 September, 2025;
originally announced September 2025.
-
Doubly robust average treatment effect estimation for survival data
Authors:
Byeonghee Lee,
Joonsung Kang
Abstract:
Considering censored outcomes in survival analysis can lead to quite complex results in the model setting of causal inference. Causal inference has attracted a lot of attention over the past few years, but little research has been done on survival analysis. Even for the only research conducted, the machine learning method was considered assuming a large sample, which is not suitable in that the ac…
▽ More
Considering censored outcomes in survival analysis can lead to quite complex results in the model setting of causal inference. Causal inference has attracted a lot of attention over the past few years, but little research has been done on survival analysis. Even for the only research conducted, the machine learning method was considered assuming a large sample, which is not suitable in that the actual data are high dimensional low sample size (HDLSS) method. Therefore, penalty is considered for numerous covariates, and the relationship between these covariates and treatment variables is reflected as a covariate balancing property score (CBPS). It also considers censored results. To this end, we will try to solve the above-mentioned problems by using penalized empirical likelihood, which considers both estimating equation and penalty. The proposed average treatment effect (ATE) estimator possesses the oracle property, exhibiting key characteristics such as double robustness for unbiasedness, sparsity in model selection, and asymptotic normality.
△ Less
Submitted 10 September, 2025;
originally announced September 2025.
-
Joint Optimization of Computation Offloading and Resource Allocation in ISAC-assisted SAGIN-based IoT
Authors:
Sooyeob Jung,
Seongah Jeong,
Jinkyu Kang
Abstract:
In this letters, an energy-efficient integrated sensing and communication (ISAC) for space-air-ground integrated network (SAGIN)-based Internet of Things (IoT) systems is proposed to facilitate wide coverage and real-time 6G services. For processing a sizable data collected at a IoT device, a hybrid edge computing scheme is applied with the cloudlets mounted at autonomous aerial vehicle (AAV) and…
▽ More
In this letters, an energy-efficient integrated sensing and communication (ISAC) for space-air-ground integrated network (SAGIN)-based Internet of Things (IoT) systems is proposed to facilitate wide coverage and real-time 6G services. For processing a sizable data collected at a IoT device, a hybrid edge computing scheme is applied with the cloudlets mounted at autonomous aerial vehicle (AAV) and low earth orbit (LEO) satellite, where the AAV with multiple antennas performs uplink sensing of the nearby target. With the aim of minimizing the total AAV's energy consumption, we optimize the duration of training and data phase and the bit allocation coupled with the offloading ratio under the constraints for offloading and sensing. Via simulations, the superiority of the proposed algorithm is verified to be pronounced with the sufficient mission time and the high sensing performance constraint.
△ Less
Submitted 9 September, 2025;
originally announced September 2025.
-
Physical origin of current-induced switching angle shift in magnetic heterostructures
Authors:
Xiaomiao Yin,
Guanglei Han,
Guowen Gong,
Jun Kang,
Changmin Xiong,
Lijun Zhu
Abstract:
Accurate quantification of the spin-orbit torques (SOTs) is critical for the identification and applications of new spin-orbitronic effects. One of the most popular techniques to qualify the SOTs is the switching angle shift, where the applied direct current was assumed to shift, via domain wall depinning during the anti-domain expansion, the switching angle of a perpendicular magnetization in a l…
▽ More
Accurate quantification of the spin-orbit torques (SOTs) is critical for the identification and applications of new spin-orbitronic effects. One of the most popular techniques to qualify the SOTs is the switching angle shift, where the applied direct current was assumed to shift, via domain wall depinning during the anti-domain expansion, the switching angle of a perpendicular magnetization in a linear proportion manner under a large rotating magnetic field. Here, we report that, for the most commonly employed perpendicular magnetization heterostructures in spintronics (e.g., those based on FeCoB, Co, and Co/Ni multilayers), the switching angle shift considerably misestimates the SOT within the domain wall depinning analysis of the slope of the linear-in-current scaling and may also have a non-zero residual value at zero direct current. Our experiments and simulations unveil that the switching angle shift is most likely dominated by the chiral asymmetric nucleation rather than the expansion of the anti-domains. The in-plane field from external magnet and current-induced SOTs lower the perpendicular nucleation field and thus the required switching angle, ultimately leading to underestimation of the SOTs by the domain wall depinning analysis. These results have advanced the understanding of magnetization switching of spintronic devices.
△ Less
Submitted 9 September, 2025;
originally announced September 2025.
-
WenetSpeech-Yue: A Large-scale Cantonese Speech Corpus with Multi-dimensional Annotation
Authors:
Longhao Li,
Zhao Guo,
Hongjie Chen,
Yuhang Dai,
Ziyu Zhang,
Hongfei Xue,
Tianlun Zuo,
Chengyou Wang,
Shuiyuan Wang,
Jie Li,
Jian Kang,
Xin Xu,
Hui Bu,
Binbin Zhang,
Ruibin Yuan,
Ziya Zhou,
Wei Xue,
Lei Xie
Abstract:
The development of speech understanding and generation has been significantly accelerated by the availability of large-scale, high-quality speech datasets. Among these, ASR and TTS are regarded as the most established and fundamental tasks. However, for Cantonese (Yue Chinese), spoken by approximately 84.9 million native speakers worldwide, limited annotated resources have hindered progress and re…
▽ More
The development of speech understanding and generation has been significantly accelerated by the availability of large-scale, high-quality speech datasets. Among these, ASR and TTS are regarded as the most established and fundamental tasks. However, for Cantonese (Yue Chinese), spoken by approximately 84.9 million native speakers worldwide, limited annotated resources have hindered progress and resulted in suboptimal ASR and TTS performance. To address this challenge, we propose WenetSpeech-Pipe, an integrated pipeline for building large-scale speech corpus with multi-dimensional annotation tailored for speech understanding and generation. It comprises six modules: Audio Collection, Speaker Attributes Annotation, Speech Quality Annotation, Automatic Speech Recognition, Text Postprocessing and Recognizer Output Voting, enabling rich and high-quality annotations. Based on this pipeline, we release WenetSpeech-Yue, the first large-scale Cantonese speech corpus with multi-dimensional annotation for ASR and TTS, covering 21,800 hours across 10 domains with annotations including ASR transcription, text confidence, speaker identity, age, gender, speech quality scores, among other annotations. We also release WSYue-eval, a comprehensive Cantonese benchmark with two components: WSYue-ASR-eval, a manually annotated set for evaluating ASR on short and long utterances, code-switching, and diverse acoustic conditions, and WSYue-TTS-eval, with base and coverage subsets for standard and generalization testing. Experimental results show that models trained on WenetSpeech-Yue achieve competitive results against state-of-the-art (SOTA) Cantonese ASR and TTS systems, including commercial and LLM-based models, highlighting the value of our dataset and pipeline.
△ Less
Submitted 5 September, 2025; v1 submitted 4 September, 2025;
originally announced September 2025.
-
Purely GHZ-like entanglement is forbidden in holography
Authors:
Vijay Balasubramanian,
Monica Jinwoo Kang,
Charlie Cummings,
Chitraang Murdia,
Simon F. Ross
Abstract:
We show that three-party entanglement signals in holography obey a relation that is not satisfied by generalized Greenberger-Horne-Zeilinger (GHZ) states. This is the first known inequality on the structure of pure three-party holographic states, and shows that time-symmetric holographic states can never have purely GHZ-like entanglement. We also discuss similar relations for four parties.
We show that three-party entanglement signals in holography obey a relation that is not satisfied by generalized Greenberger-Horne-Zeilinger (GHZ) states. This is the first known inequality on the structure of pure three-party holographic states, and shows that time-symmetric holographic states can never have purely GHZ-like entanglement. We also discuss similar relations for four parties.
△ Less
Submitted 15 September, 2025; v1 submitted 3 September, 2025;
originally announced September 2025.
-
Dependency Chain Analysis of ROS 2 DDS QoS Policies: From Lifecycle Tutorial to Static Verification
Authors:
Sanghoon Lee,
Junha Kang,
Kyung-Joon Park
Abstract:
Robot Operating System 2 (ROS 2) relies on the Data Distribution Service (DDS), which offers more than 20 Quality of Service (QoS) policies governing availability, reliability, and resource usage. Yet ROS 2 users lack clear guidance on safe policy combinations and validation processes prior to deployment, which often leads to trial-and-error tuning and unexpected runtime failures. To address these…
▽ More
Robot Operating System 2 (ROS 2) relies on the Data Distribution Service (DDS), which offers more than 20 Quality of Service (QoS) policies governing availability, reliability, and resource usage. Yet ROS 2 users lack clear guidance on safe policy combinations and validation processes prior to deployment, which often leads to trial-and-error tuning and unexpected runtime failures. To address these challenges, we analyze DDS Publisher-Subscriber communication over a life cycle divided into Discovery, Data Exchange, and Disassociation, and provide a user oriented tutorial explaining how 16 QoS policies operate in each phase. Building on this analysis, we derive a QoS dependency chain that formalizes inter-policy relationships and classifies 41 dependency violation rules, capturing constraints that commonly cause communication failures in practice. Finally, we introduce QoS Guard, a ROS 2 package that statically validates DDS XML profiles offline, flags conflicts, and enables safe, predeployment tuning without establishing a live ROS 2 session. Together, these contributions give ROS 2 users both conceptual insight and a concrete tool that enables early detection of misconfigurations, improving the reliability and resource efficiency of ROS 2 based robotic systems.
△ Less
Submitted 3 September, 2025;
originally announced September 2025.
-
Octupole-driven spin-transfer torque switching of all-antiferromagnetic tunnel junctions
Authors:
Jaimin Kang,
Mohammad Hamdi,
Shun Kong Cheung,
Lin-Ding Yuan,
Mohamed Elekhtiar,
William Rogers,
Andrea Meo,
Peter G. Lim,
M. S. Nicholas Tey,
Anthony D'Addario,
Shiva T. Konakanchi,
Eric Matt,
Jordan Athas,
Sevdenur Arpaci,
Lei Wan,
Sanjay C. Mehta,
Pramey Upadhyaya,
Mario Carpentieri,
Vinayak P. Dravid,
Mark C. Hersam,
Jordan A. Katine,
Gregory D. Fuchs,
Giovanni Finocchio,
Evgeny Y. Tsymbal,
James M. Rondinelli
, et al. (1 additional authors not shown)
Abstract:
Magnetic tunnel junctions (MTJs) based on ferromagnets are canonical devices in spintronics, with wide-ranging applications in data storage, computing, and sensing. They simultaneously exhibit mechanisms for electrical detection of magnetic order through the tunneling magnetoresistance (TMR) effect, and reciprocally, for controlling magnetic order by electric currents through spin-transfer torque…
▽ More
Magnetic tunnel junctions (MTJs) based on ferromagnets are canonical devices in spintronics, with wide-ranging applications in data storage, computing, and sensing. They simultaneously exhibit mechanisms for electrical detection of magnetic order through the tunneling magnetoresistance (TMR) effect, and reciprocally, for controlling magnetic order by electric currents through spin-transfer torque (STT). It was long assumed that neither of these effects could be sizeable in tunnel junctions made from antiferromagnetic materials, since they exhibit no net magnetization. Recently, however, it was shown that all-antiferromagnetic tunnel junctions (AFMTJs) based on chiral antiferromagnets do exhibit TMR due to their non-relativistic momentum-dependent spin polarization and cluster magnetic octupole moment, which are manifestations of their spin-split band structure. However, the reciprocal effect, i.e., the antiferromagnetic counterpart of STT driven by currents through the AFMTJ, has been assumed non-existent due to the total electric current being spin-neutral. Here, in contrast to this common expectation, we report nanoscale AFMTJs exhibiting this reciprocal effect, which we term octupole-driven spin-transfer torque (OTT). We demonstrate current-induced OTT switching of PtMn3|MgO|PtMn3 AFMTJs, fabricated on a thermally oxidized silicon substrate, exhibiting a record-high TMR value of 363% at room temperature and switching current densities of the order of 10 MA/cm2. Our theoretical modeling explains the origin of OTT in terms of the imbalance between intra- and inter-sublattice spin currents across the AFMTJ, and equivalently, in terms of the non-zero net cluster octupole polarization of each PtMn3 layer. This work establishes a new materials platform for antiferromagnetic spintronics and provides a pathway towards deeply scaled magnetic memory and room-temperature terahertz technologies.
△ Less
Submitted 3 September, 2025;
originally announced September 2025.
-
The ALMA-QUARKS survey: Extensive detection of acetamide in multiple high-mass star-forming regions
Authors:
Chunguo Duan,
Xuefang Xu,
Qian Gou,
Tie Liu,
Laurent Pagani,
Fengwei Xu,
Ke Wang,
Xunchuan Liu,
Jun Kang,
Mingwei He,
Jiaxiang Jiao
Abstract:
Acetamide (CH$_{3}$CONH$_{2}$), a key interstellar amide and a methyl derivative of formamide (NH$_{2}$CHO), has been sparsely detected, limiting insights into its prebiotic relevance. We present the first systematic survey for acetamide toward 52 hot molecular cores using ALMA Band 6 data. Acetamide has been detected in 10 cores, markedly expanding the inventory of known emitters. The derived col…
▽ More
Acetamide (CH$_{3}$CONH$_{2}$), a key interstellar amide and a methyl derivative of formamide (NH$_{2}$CHO), has been sparsely detected, limiting insights into its prebiotic relevance. We present the first systematic survey for acetamide toward 52 hot molecular cores using ALMA Band 6 data. Acetamide has been detected in 10 cores, markedly expanding the inventory of known emitters. The derived column densities of acetamide range from $(2.5\pm0.9)\times10^{14}$ to $(1.5\pm0.6)\times10^{16}$ cm$^{-2}$, compared to formamide's $(1.1\pm0.1)\times10^{15}$ to $(6.9\pm0.4)\times10^{16}$ cm$^{-2}$. The nearly constant abundance ratios (~3-9) and strong abundance correlation between the two amides across sources suggest a chemically linked formation pathway, likely on grain surfaces. The presence of peptide-like molecules in these regions implies that complex organic species can survive star formation processes, offering a potential pathway toward prebiotic chemistry. These findings constrain the dominant grain surface formation routes of acetamide, confirm its broader prevalence in highmass star-forming regions, and underscore the importance of targeted amide surveys in tracing the chemical evolution toward prebiotic complexity.
△ Less
Submitted 17 September, 2025; v1 submitted 1 September, 2025;
originally announced September 2025.
-
Multilingual Speech Recognition Using Discrete Tokens with a Two-step Training Strategy
Authors:
Zehan Li,
Yan Yang,
Xueqing Li,
Jian Kang,
Xiao-Lei Zhang,
Jie Li
Abstract:
Pre-trained models, especially self-supervised learning (SSL) models, have demonstrated impressive results in automatic speech recognition (ASR) task. While most applications of SSL models focus on leveraging continuous representations as features for training downstream tasks, the utilization of discrete units has gained increasing attention in recent years owing to its lower storage requirements…
▽ More
Pre-trained models, especially self-supervised learning (SSL) models, have demonstrated impressive results in automatic speech recognition (ASR) task. While most applications of SSL models focus on leveraging continuous representations as features for training downstream tasks, the utilization of discrete units has gained increasing attention in recent years owing to its lower storage requirements and broader range of applications. In multilingual ASR tasks, representations at different layers of the model contribute differently to various languages, complicating the unification of discrete unit modeling. In this paper, we propose a two-stage training strategy to improve the discrete token performance of pre-trained models and narrow the gap with continuous representation performance. We validate our method on the XLS-R model following the settings of Interspeech2024 Speech Processing Using Discrete Speech Unit Challenge. Our method demonstrates a significant improvement on the ML-SUPERB dataset, achieving a 44% relative reduction on CER for the XLS-R model. This surpasses the previous baseline set by the WavLM model, which achieves a 26% relative reduction on CER. Furthermore, our method achieves the first place among all the single-system results on the leaderboard.
△ Less
Submitted 1 September, 2025;
originally announced September 2025.
-
Disentangled Multi-Context Meta-Learning: Unlocking robust and Generalized Task Learning
Authors:
Seonsoo Kim,
Jun-Gill Kang,
Taehong Kim,
Seongil Hong
Abstract:
In meta-learning and its downstream tasks, many methods rely on implicit adaptation to task variations, where multiple factors are mixed together in a single entangled representation. This makes it difficult to interpret which factors drive performance and can hinder generalization. In this work, we introduce a disentangled multi-context meta-learning framework that explicitly assigns each task fa…
▽ More
In meta-learning and its downstream tasks, many methods rely on implicit adaptation to task variations, where multiple factors are mixed together in a single entangled representation. This makes it difficult to interpret which factors drive performance and can hinder generalization. In this work, we introduce a disentangled multi-context meta-learning framework that explicitly assigns each task factor to a distinct context vector. By decoupling these variations, our approach improves robustness through deeper task understanding and enhances generalization by enabling context vector sharing across tasks with shared factors. We evaluate our approach in two domains. First, on a sinusoidal regression task, our model outperforms baselines on out-of-distribution tasks and generalizes to unseen sine functions by sharing context vectors associated with shared amplitudes or phase shifts. Second, in a quadruped robot locomotion task, we disentangle the robot-specific properties and the characteristics of the terrain in the robot dynamics model. By transferring disentangled context vectors acquired from the dynamics model into reinforcement learning, the resulting policy achieves improved robustness under out-of-distribution conditions, surpassing the baselines that rely on a single unified context. Furthermore, by effectively sharing context, our model enables successful sim-to-real policy transfer to challenging terrains with out-of-distribution robot-specific properties, using just 20 seconds of real data from flat terrain, a result not achievable with single-task adaptation.
△ Less
Submitted 1 September, 2025;
originally announced September 2025.
-
Domain Adaptation-Based Crossmodal Knowledge Distillation for 3D Semantic Segmentation
Authors:
Jialiang Kang,
Jiawen Wang,
Dingsheng Luo
Abstract:
Semantic segmentation of 3D LiDAR data plays a pivotal role in autonomous driving. Traditional approaches rely on extensive annotated data for point cloud analysis, incurring high costs and time investments. In contrast, realworld image datasets offer abundant availability and substantial scale. To mitigate the burden of annotating 3D LiDAR point clouds, we propose two crossmodal knowledge distill…
▽ More
Semantic segmentation of 3D LiDAR data plays a pivotal role in autonomous driving. Traditional approaches rely on extensive annotated data for point cloud analysis, incurring high costs and time investments. In contrast, realworld image datasets offer abundant availability and substantial scale. To mitigate the burden of annotating 3D LiDAR point clouds, we propose two crossmodal knowledge distillation methods: Unsupervised Domain Adaptation Knowledge Distillation (UDAKD) and Feature and Semantic-based Knowledge Distillation (FSKD). Leveraging readily available spatio-temporally synchronized data from cameras and LiDARs in autonomous driving scenarios, we directly apply a pretrained 2D image model to unlabeled 2D data. Through crossmodal knowledge distillation with known 2D-3D correspondence, we actively align the output of the 3D network with the corresponding points of the 2D network, thereby obviating the necessity for 3D annotations. Our focus is on preserving modality-general information while filtering out modality-specific details during crossmodal distillation. To achieve this, we deploy self-calibrated convolution on 3D point clouds as the foundation of our domain adaptation module. Rigorous experimentation validates the effectiveness of our proposed methods, consistently surpassing the performance of state-of-the-art approaches in the field.
△ Less
Submitted 30 August, 2025;
originally announced September 2025.
-
On a class of third order differential equations describing pseudospherical or spherical surfaces
Authors:
Mingyue Guo,
Jing Kang,
Zhenhua Shi,
Zhiwei Wu
Abstract:
In this paper, we study third order nonlinear partial differential equations which describe surfaces of constant curvature. From the flatness of connection 1-forms, we present a classification of equations with the type $u_t - u_{xxt} = λu^2 u_{xxx} + G(u, u_x, u_{xx}) (λ\in\mathbb{R})$, which describe pseudospherical or spherical surfaces. We show that series of typical soliton equations belong t…
▽ More
In this paper, we study third order nonlinear partial differential equations which describe surfaces of constant curvature. From the flatness of connection 1-forms, we present a classification of equations with the type $u_t - u_{xxt} = λu^2 u_{xxx} + G(u, u_x, u_{xx}) (λ\in\mathbb{R})$, which describe pseudospherical or spherical surfaces. We show that series of typical soliton equations belong to certain subclass, such as the generalized Camassa-Holm equation, which gives a geometric explanation to these equations.
△ Less
Submitted 28 August, 2025;
originally announced August 2025.
-
DIFNet: Decentralized Information Filtering Fusion Neural Network with Unknown Correlation in Sensor Measurement Noises
Authors:
Ruifeng Dong,
Ming Wang,
Ning Liu,
Tong Guo,
Jiayi Kang,
Xiaojing Shen,
Yao Mao
Abstract:
In recent years, decentralized sensor networks have garnered significant attention in the field of state estimation owing to enhanced robustness, scalability, and fault tolerance. Optimal fusion performance can be achieved under fully connected communication and known noise correlation structures. To mitigate communication overhead, the global state estimation problem is decomposed into local subp…
▽ More
In recent years, decentralized sensor networks have garnered significant attention in the field of state estimation owing to enhanced robustness, scalability, and fault tolerance. Optimal fusion performance can be achieved under fully connected communication and known noise correlation structures. To mitigate communication overhead, the global state estimation problem is decomposed into local subproblems through structured observation model. This ensures that even when the communication network is not fully connected, each sensor can achieve locally optimal estimates of its observable state components. To address the degradation of fusion accuracy induced by unknown correlations in measurement noise, this paper proposes a data-driven method, termed Decentralized Information Filter Neural Network (DIFNet), to learn unknown noise correlations in data for discrete-time nonlinear state space models with cross-correlated measurement noises. Numerical simulations demonstrate that DIFNet achieves superior fusion performance compared to conventional filtering methods and exhibits robust characteristics in more complex scenarios, such as the presence of time-varying noise. The source code used in our numerical experiment can be found online at https://wisdom-estimation.github.io/DIFNet_Demonstrate/.
△ Less
Submitted 26 August, 2025;
originally announced August 2025.