-
Knowledge Rectification for Camouflaged Object Detection: Unlocking Insights from Low-Quality Data
Authors:
Juwei Guan,
Xiaolin Fang,
Donghyun Kim,
Haotian Gong,
Tongxin Zhu,
Zhen Ling,
Ming Yang
Abstract:
Low-quality data often suffer from insufficient image details, introducing an extra implicit aspect of camouflage that complicates camouflaged object detection (COD). Existing COD methods focus primarily on high-quality data, overlooking the challenges posed by low-quality data, which leads to significant performance degradation. Therefore, we propose KRNet, the first framework explicitly designed…
▽ More
Low-quality data often suffer from insufficient image details, introducing an extra implicit aspect of camouflage that complicates camouflaged object detection (COD). Existing COD methods focus primarily on high-quality data, overlooking the challenges posed by low-quality data, which leads to significant performance degradation. Therefore, we propose KRNet, the first framework explicitly designed for COD on low-quality data. KRNet presents a Leader-Follower framework where the Leader extracts dual gold-standard distributions: conditional and hybrid, from high-quality data to drive the Follower in rectifying knowledge learned from low-quality data. The framework further benefits from a cross-consistency strategy that improves the rectification of these distributions and a time-dependent conditional encoder that enriches the distribution diversity. Extensive experiments on benchmark datasets demonstrate that KRNet outperforms state-of-the-art COD methods and super-resolution-assisted COD approaches, proving its effectiveness in tackling the challenges of low-quality data in COD.
△ Less
Submitted 28 March, 2025;
originally announced March 2025.
-
AdaRank: Adaptive Rank Pruning for Enhanced Model Merging
Authors:
Chanhyuk Lee,
Jiho Choi,
Chanryeol Lee,
Donggyun Kim,
Seunghoon Hong
Abstract:
Model merging has emerged as a promising approach for unifying independently fine-tuned models into an integrated framework, significantly enhancing computational efficiency in multi-task learning. Recently, several SVD-based techniques have been introduced to exploit low-rank structures for enhanced merging, but their reliance on such manually designed rank selection often leads to cross-task int…
▽ More
Model merging has emerged as a promising approach for unifying independently fine-tuned models into an integrated framework, significantly enhancing computational efficiency in multi-task learning. Recently, several SVD-based techniques have been introduced to exploit low-rank structures for enhanced merging, but their reliance on such manually designed rank selection often leads to cross-task interference and suboptimal performance. In this paper, we propose AdaRank, a novel model merging framework that adaptively selects the most beneficial singular directions of task vectors to merge multiple models. We empirically show that the dominant singular components of task vectors can cause critical interference with other tasks, and that naive truncation across tasks and layers degrades performance. In contrast, AdaRank dynamically prunes the singular components that cause interference and offers an optimal amount of information to each task vector by learning to prune ranks during test-time via entropy minimization. Our analysis demonstrates that such method mitigates detrimental overlaps among tasks, while empirical results show that AdaRank consistently achieves state-of-the-art performance with various backbones and number of tasks, reducing the performance gap between fine-tuned models to nearly 1%.
△ Less
Submitted 28 March, 2025;
originally announced March 2025.
-
Dataset Distillation of 3D Point Clouds via Distribution Matching
Authors:
Jae-Young Yim,
Dongwook Kim,
Jae-Young Sim
Abstract:
Large-scale datasets are usually required to train deep neural networks, but it increases the computational complexity hindering the practical applications. Recently, dataset distillation for images and texts has been attracting a lot of attention, that reduces the original dataset to a synthetic dataset to alleviate the computational burden of training while preserving essential task-relevant inf…
▽ More
Large-scale datasets are usually required to train deep neural networks, but it increases the computational complexity hindering the practical applications. Recently, dataset distillation for images and texts has been attracting a lot of attention, that reduces the original dataset to a synthetic dataset to alleviate the computational burden of training while preserving essential task-relevant information. However, the dataset distillation for 3D point clouds remains largely unexplored, as the point clouds exhibit fundamentally different characteristics from that of images, making the dataset distillation more challenging. In this paper, we propose a distribution matching-based distillation framework for 3D point clouds that jointly optimizes the geometric structures as well as the orientations of the synthetic 3D objects. To address the semantic misalignment caused by unordered indexing of points, we introduce a Semantically Aligned Distribution Matching loss computed on the sorted features in each channel. Moreover, to address the rotation variation, we jointly learn the optimal rotation angles while updating the synthetic dataset to better align with the original feature distribution. Extensive experiments on widely used benchmark datasets demonstrate that the proposed method consistently outperforms existing dataset distillation methods, achieving superior accuracy and strong cross-architecture generalization.
△ Less
Submitted 28 May, 2025; v1 submitted 28 March, 2025;
originally announced March 2025.
-
X-ray Polarization of the High-Synchrotron-Peak BL Lacertae Object 1ES 1959+650 during Intermediate and High X-ray Flux States
Authors:
Luigi Pacciani,
Dawoon E. Kim,
Riccardo Middei,
Herman L. Marshall,
Alan P. Marscher,
Ioannis Liodakis,
Iván Agudo,
Svetlana G. Jorstad,
Juri Poutanen,
Manel Errando,
Laura Di Gesu,
Michela Negro,
Fabrizio Tavecchio,
Kinwah Wu,
Chien-Ting Chen,
Fabio Muleri,
Lucio Angelo Antonelli,
Immacolata Donnarumma,
Steven R. Ehlert,
Francesco Massaro,
Stephen L. O'Dell,
Matteo Perri,
Simonetta Puccetti,
Giacomo Bonnoli,
Pouya M. Kouch
, et al. (75 additional authors not shown)
Abstract:
We report the Imaging X-ray Polarimetry Explorer (IXPE) polarimetric and simultaneous multiwavelength observations of the high-energy-peaked BL Lacertae (HBL) object 1ES 1959+650, performed in 2022 October and 2023 August. In 2022 October IXPE measured an average polarization degree $Π_{\rm X}=9.4\;\!\%\pm 1.6\;\!\%$ and an electric-vector position angle $ψ_{\rm X}=53^{\circ}\pm 5^{\circ}$. The po…
▽ More
We report the Imaging X-ray Polarimetry Explorer (IXPE) polarimetric and simultaneous multiwavelength observations of the high-energy-peaked BL Lacertae (HBL) object 1ES 1959+650, performed in 2022 October and 2023 August. In 2022 October IXPE measured an average polarization degree $Π_{\rm X}=9.4\;\!\%\pm 1.6\;\!\%$ and an electric-vector position angle $ψ_{\rm X}=53^{\circ}\pm 5^{\circ}$. The polarized X-ray emission can be decomposed into a constant component, plus a rotating component, with rotation velocity $ω_{\rm EVPA}=(-117\;\!\pm\;\!12)$ ${\rm deg}\;\!{\rm d}^{-1}$. In 2023 August, during a period of pronounced activity of the source, IXPE measured an average $Π_{\rm X}=12.4\;\!\%\pm0.7\;\!\%$ and $ψ_X=20^{\circ}\pm2^{\circ}$, with evidence ($\sim$0.4$\;\!\%$ chance probability) for a rapidly rotating component with $ω_{\rm EVPA}=(1864\;\!\pm\;\!34)$ ${\rm deg}\;\!{\rm d}^{-1}$. These findings suggest the presence of a helical magnetic field in the jet of 1ES 1959+650 or stochastic processes governing the field in turbulent plasma. Our multiwavelength campaigns from radio to X-ray reveal variability in both polarization and flux from optical to X-rays. We interpret the results in terms of a relatively slowly varying component dominating the radio and optical emission, while rapidly variable polarized components dominate the X-ray and provide minor contribution at optical wavelengths. The radio and optical data indicate that on parsec scales the magnetic field is primarily orthogonal to the jet direction. On the contrary, X-ray measurements show a magnetic field almost aligned with the parsec jet direction. Confronting with other IXPE observations, we guess that the magnetic field of HBLs on sub-pc scale should be rather unstable, often changing its direction with respect to the VLBA jet.
△ Less
Submitted 27 March, 2025;
originally announced March 2025.
-
Possibility of using 3.3$μ$m PAH luminosity as a molecular gas mass estimator
Authors:
Hyunjin Shim,
Junhyun Baek,
Dohyeong Kim,
Minjin Kim,
Hyunmi Song,
Gu Lim,
Jaejun Cho,
Hayeong Jeong,
Yejin Jeong,
Ye-Eun Kang,
Dongseob Lee,
Junyeong Park,
Eunsuk Seo,
Junho Song,
Been Yeo
Abstract:
We present CO(1-0) observations of 50 star-forming galaxies at 0.01<z<0.35, for which 3.3$\,μ$m PAH emission flux or its upper limit is available. A scaling relation between 3.3$\,μ$m PAH luminosity and CO(1-0) luminosity is established covering ~2 orders of magnitude in total IR luminosity and CO luminosity, with a scatter of ~0.23 dex:…
▽ More
We present CO(1-0) observations of 50 star-forming galaxies at 0.01<z<0.35, for which 3.3$\,μ$m PAH emission flux or its upper limit is available. A scaling relation between 3.3$\,μ$m PAH luminosity and CO(1-0) luminosity is established covering ~2 orders of magnitude in total IR luminosity and CO luminosity, with a scatter of ~0.23 dex: $\mathrm{log}\,L_\mathrm{3.3}/\mathrm{L}_\odot=(1.00\pm0.07)\times\mathrm{log}\,L_\mathrm{CO(1-0)}^\prime/(\mathrm{K\,km\,s^{-1}\,pc^2})+(-1.10\pm0.70)$. The slope is near unity, allowing the use of a single value of $\langle\mathrm{log}\,(L_\mathrm{3.3}/L_\mathrm{CO(1-0)}^\prime)\rangle=-1.09\pm0.36~[\mathrm{L}_\odot/(\mathrm{K\,km\,s^{-1}\,pc^2})]$ in the conversion between 3.3$\,μ$m PAH and CO luminosities. The variation in the $L_\mathrm{3.3}/L_\mathrm{CO}^\prime$ ratio is not dependent on the galaxy properties, including total IR luminosity, stellar mass, and SFR excess. The total gas mass, estimated using dust-to-gas ratio and dust mass, is correlated with 3.3$\,μ$m PAH luminosity, in line with the prescription using $α_\mathrm{CO}=0.8-4.5$ covering both normal star-forming galaxies and starburst galaxies. AGN-dominated galaxies tend to have a lower $L_\mathrm{3.3}/L_\mathrm{CO}^\prime$ than non-AGN galaxies, which needs to be investigated further with an increased sample size. The established $L_\mathrm{3.3}$-$L_\mathrm{CO}^\prime$ correlation is expected to be applicable to wide-field near-infrared spectrophotometric surveys that allow the detection of 3.3$\,μ$m emission from numerous low-redshift galaxies.
△ Less
Submitted 27 March, 2025;
originally announced March 2025.
-
ESPPU INPUT: C$^3$ within the "Linear Collider Vision"
Authors:
Matthew B. Andorf,
Mei Bai,
Pushpalatha Bhat,
Valery Borzenets,
Martin Breidenbach,
Sridhara Dasu,
Ankur Dhar,
Tristan du Pree,
Lindsey Gray,
Spencer Gessner,
Ryan Herbst,
Andrew Haase,
Erik Jongewaard,
Dongsung Kim,
Anoop Nagesh Koushik,
Anatoly K. Krasnykh,
Zenghai Li,
Chao Liu,
Jared Maxson,
Julian Merrick,
Sophia L. Morton,
Emilio A. Nanni,
Alireza Nassiri,
Cho-Kuen Ng,
Dimitrios Ntounis
, et al. (12 additional authors not shown)
Abstract:
The Linear Collider Vision calls for a Linear Collider Facility with a physics reach from a Higgs Factory to the TeV-scale with $e^+e^{-}$ collisions. One of the technologies under consideration for the accelerator is a cold-copper distributed-coupling linac capable of achieving high gradient. This technology is being pursued by the C$^3$ collaboration to understand its applicability to future col…
▽ More
The Linear Collider Vision calls for a Linear Collider Facility with a physics reach from a Higgs Factory to the TeV-scale with $e^+e^{-}$ collisions. One of the technologies under consideration for the accelerator is a cold-copper distributed-coupling linac capable of achieving high gradient. This technology is being pursued by the C$^3$ collaboration to understand its applicability to future colliders and broader scientific applications. In this input we share the baseline parameters for a C$^3$ Higgs-factory and the energy reach of up to 3 TeV in the 33 km tunnel foreseen under the Linear Collider Vision. Recent results, near-term plans and future R\&D needs are highlighted.
△ Less
Submitted 6 April, 2025; v1 submitted 26 March, 2025;
originally announced March 2025.
-
Resilient Sensor Fusion under Adverse Sensor Failures via Multi-Modal Expert Fusion
Authors:
Konyul Park,
Yecheol Kim,
Daehun Kim,
Jun Won Choi
Abstract:
Modern autonomous driving perception systems utilize complementary multi-modal sensors, such as LiDAR and cameras. Although sensor fusion architectures enhance performance in challenging environments, they still suffer significant performance drops under severe sensor failures, such as LiDAR beam reduction, LiDAR drop, limited field of view, camera drop, and occlusion. This limitation stems from i…
▽ More
Modern autonomous driving perception systems utilize complementary multi-modal sensors, such as LiDAR and cameras. Although sensor fusion architectures enhance performance in challenging environments, they still suffer significant performance drops under severe sensor failures, such as LiDAR beam reduction, LiDAR drop, limited field of view, camera drop, and occlusion. This limitation stems from inter-modality dependencies in current sensor fusion frameworks. In this study, we introduce an efficient and robust LiDAR-camera 3D object detector, referred to as MoME, which can achieve robust performance through a mixture of experts approach. Our MoME fully decouples modality dependencies using three parallel expert decoders, which use camera features, LiDAR features, or a combination of both to decode object queries, respectively. We propose Multi-Expert Decoding (MED) framework, where each query is decoded selectively using one of three expert decoders. MoME utilizes an Adaptive Query Router (AQR) to select the most appropriate expert decoder for each query based on the quality of camera and LiDAR features. This ensures that each query is processed by the best-suited expert, resulting in robust performance across diverse sensor failure scenarios. We evaluated the performance of MoME on the nuScenes-R benchmark. Our MoME achieved state-of-the-art performance in extreme weather and sensor failure conditions, significantly outperforming the existing models across various sensor failure scenarios.
△ Less
Submitted 31 March, 2025; v1 submitted 25 March, 2025;
originally announced March 2025.
-
Combined Annual Modulation Dark Matter Search with COSINE-100 and ANAIS-112
Authors:
N. Carlin,
J. Y. Cho,
J. J. Choi,
S. Choi,
A. C. Ezeribe,
L. E. França,
C. Ha,
I. S. Hahn,
S. J. Hollick,
S. B. Hong,
E. J. Jeon,
H. W. Joo,
W. G. Kang,
M. Kauer,
B. H. Kim,
H. J. Kim,
J. Kim,
K. W. Kim,
S. H. Kim,
S. K. Kim,
W. K. Kim,
Y. D. Kim,
Y. H. Kim,
Y. J. Ko,
D. H. Lee
, et al. (49 additional authors not shown)
Abstract:
The annual modulation signal, claimed to be consistent with dark matter as observed by DAMA/LIBRA in a sodium-iodide based detector, has persisted for over two decades. COSINE-100 and ANAIS-112 were designed to test the claim directly using the same target material. COSINE-100, located at Yangyang Underground Laboratory in South Korea, and ANAIS-112, located at Canfranc Underground Laboratory in S…
▽ More
The annual modulation signal, claimed to be consistent with dark matter as observed by DAMA/LIBRA in a sodium-iodide based detector, has persisted for over two decades. COSINE-100 and ANAIS-112 were designed to test the claim directly using the same target material. COSINE-100, located at Yangyang Underground Laboratory in South Korea, and ANAIS-112, located at Canfranc Underground Laboratory in Spain, have been taking data since 2016 and 2017, respectively. Each experiment published its respective results independently. In this paper, we present the results of an annual modulation search as a test of the signal observed by DAMA/LIBRA with the first three respective years of data from COSINE-100 and ANAIS-112. Using a Markov Chain Monte Carlo method, we find best fit values for modulation amplitude of $-0.0002 {\pm} 0.0026$ cpd/kg/keV in the 1-6 keV and $0.0021 {\pm} 0.0028$ cpd/kg/keV in the 2-6 keV energy regions. These results are not compatible with DAMA/LIBRA's assertion for their observation of annual modulation at $3.7σ$ and $2.6σ$, respectively. Performing a simple combination of the newly released 6-years datasets from both experiments find values consistent with no modulation at $0.0005 {\pm} 0.0019$ cpd/kg/keV in the 1-6 keV and $0.0027 {\pm} 0.0021$ cpd/kg/keV in the 2-6 keV energy regions with $4.68σ$ and $3.53σ$ respective exclusions of the DAMA/LIBRA signal.
△ Less
Submitted 25 March, 2025;
originally announced March 2025.
-
Systematic Reanalysis of KMTNet Microlensing Events, Paper II: Two New Planets in Giant-Source Events
Authors:
Hongjing Yang,
Jennifer C. Yee,
Jiyuan Zhang,
Chung-Uk Lee,
Dong-Jin Kim,
Ian A. Bond,
Andrzej Udalski,
Kyu-Ha Hwang,
Weicheng Zang,
Qiyue Qian,
Andrew Gould,
Shude Mao,
Michael D. Albrow,
Sun-Ju Chung,
Cheongho Han,
Youn Kil Jung,
Yoon-Hyun Ryu,
In-Gu Shin,
Yossi Shvartzvald,
Sang-Mok Cha,
Hyoun-Woo Kim,
Seung-Lee Kim,
Dong-Joo Lee,
Yongseok Lee,
Byeong-Gon Park
, et al. (39 additional authors not shown)
Abstract:
In this work, we continue to apply the updated KMTNet tender-love care (TLC) photometric pipeline to historical microlensing events. We apply the pipeline to a subsample of events from the KMTNet database, which we refer to as the giant source sample. Leveraging the improved photometric data, we conduct a systematic search for anomalies within this sample. The search successfully uncovers four new…
▽ More
In this work, we continue to apply the updated KMTNet tender-love care (TLC) photometric pipeline to historical microlensing events. We apply the pipeline to a subsample of events from the KMTNet database, which we refer to as the giant source sample. Leveraging the improved photometric data, we conduct a systematic search for anomalies within this sample. The search successfully uncovers four new planet-like anomalies and recovers two previously known planetary signals. After detailed analysis, two of the newly discovered anomalies are confirmed as clear planets: KMT-2019-BLG-0578 and KMT-2021-BLG-0736. Their planet-to-host mass ratios are $q\sim4\times10^{-3}$ and $q\sim1\times10^{-4}$, respectively. Another event, OGLE-2018-BLG-0421 (KMT-2018-BLG-0831), remains ambiguous. Both a stellar companion and a giant planet in the lens system could potentially explain the observed anomaly. The anomaly signal of the last event, MOA-2022-BLG-038 (KMT-2022-BLG-2342), is attributed to an extra source star. Within this sample, our procedure doubles the number of confirmed planets, demonstrating a significant enhancement in the survey sensitivity.
△ Less
Submitted 25 April, 2025; v1 submitted 25 March, 2025;
originally announced March 2025.
-
DeClotH: Decomposable 3D Cloth and Human Body Reconstruction from a Single Image
Authors:
Hyeongjin Nam,
Donghwan Kim,
Jeongtaek Oh,
Kyoung Mu Lee
Abstract:
Most existing methods of 3D clothed human reconstruction from a single image treat the clothed human as a single object without distinguishing between cloth and human body. In this regard, we present DeClotH, which separately reconstructs 3D cloth and human body from a single image. This task remains largely unexplored due to the extreme occlusion between cloth and the human body, making it challe…
▽ More
Most existing methods of 3D clothed human reconstruction from a single image treat the clothed human as a single object without distinguishing between cloth and human body. In this regard, we present DeClotH, which separately reconstructs 3D cloth and human body from a single image. This task remains largely unexplored due to the extreme occlusion between cloth and the human body, making it challenging to infer accurate geometries and textures. Moreover, while recent 3D human reconstruction methods have achieved impressive results using text-to-image diffusion models, directly applying such an approach to this problem often leads to incorrect guidance, particularly in reconstructing 3D cloth. To address these challenges, we propose two core designs in our framework. First, to alleviate the occlusion issue, we leverage 3D template models of cloth and human body as regularizations, which provide strong geometric priors to prevent erroneous reconstruction by the occlusion. Second, we introduce a cloth diffusion model specifically designed to provide contextual information about cloth appearance, thereby enhancing the reconstruction of 3D cloth. Qualitative and quantitative experiments demonstrate that our proposed approach is highly effective in reconstructing both 3D cloth and the human body. More qualitative results are provided at https://hygenie1228.github.io/DeClotH/.
△ Less
Submitted 25 March, 2025;
originally announced March 2025.
-
Uniform Diophantine approximation on the Hecke group $\mathbf H_4$
Authors:
Ayreena Bakhtawar,
Dong Han Kim,
Seul Bee Lee
Abstract:
Dirichlet's uniform approximation theorem is a fundamental result in Diophantine approximation that gives an optimal rate of approximation with a given bound. We study uniform Diophantine approximation properties on the Hecke group $\mathbf H_4$. For a given real number $α$, we characterize the sequence of $\mathbf H_4$-best approximations of $α$ and show that they are convergents of the Rosen con…
▽ More
Dirichlet's uniform approximation theorem is a fundamental result in Diophantine approximation that gives an optimal rate of approximation with a given bound. We study uniform Diophantine approximation properties on the Hecke group $\mathbf H_4$. For a given real number $α$, we characterize the sequence of $\mathbf H_4$-best approximations of $α$ and show that they are convergents of the Rosen continued fraction and the dual Rosen continued fraction of $α$. We give analogous theorems of Dirichlet uniform approximation and the Legendre theorem with optimal constants.
△ Less
Submitted 24 March, 2025;
originally announced March 2025.
-
KMTNet View of Blue Large-amplitude Pulsators Toward the Galactic Bulge: I. Discovery of Wide-orbit Companions in OGLE-BLAP-006
Authors:
Seung-Lee Kim,
Chung-Uk Lee,
Kyeongsoo Hong,
Jae Woo Lee,
Dong-Jin Kim,
Sang-Mok Cha,
Yongseok Lee,
Dong-Joo Lee,
Byeong-Gon Park
Abstract:
Blue large-amplitude pulsators (BLAPs), a recently classified type of variable stars, are evolved objects likely formed through interactions between stars in a binary system. However, only two BLAPs with stellar companions have been discovered to date. This paper presents photometric data from the Korea Microlensing Telescope Network (KMTNet) for three BLAPs located in the direction of the Galacti…
▽ More
Blue large-amplitude pulsators (BLAPs), a recently classified type of variable stars, are evolved objects likely formed through interactions between stars in a binary system. However, only two BLAPs with stellar companions have been discovered to date. This paper presents photometric data from the Korea Microlensing Telescope Network (KMTNet) for three BLAPs located in the direction of the Galactic bulge: OGLE-BLAP-006, OGLE-BLAP-007, and OGLE-BLAP-009. The data were collected over eight consecutive years, beginning in 2016, with a high cadence of approximately 15 minutes. Frequency analysis of light variations revealed OGLE-BLAP-006 as a multimode pulsator with a dominant frequency of 37.88 day$^{-1}$ and two new frequencies of 38.25 and 35.05 day$^{-1}$. In contrast, OGLE-BLAP-007 and OGLE-BLAP-009 exhibit single-mode pulsation. By combining the KMTNet data with archival OGLE observations, we investigated pulsation timing variations of the BLAPs using an $O-C$ diagram to identify the light travel time effect caused by the orbital motion of their companions. We found that OGLE-BLAP-006, with no evidence of close companions, has two wide-orbit companions with orbital periods of approximately 4,700 and 6,300 days, making it the third known BLAP in a stellar system; however, no companions were found for OGLE-BLAP-007 and OGLE-BLAP-009. Furthermore, we identified seven other BLAP candidates with wide companions using OGLE data, suggesting that such systems are relatively common. We propose that a BLAP with a wide companion may be a merger remnant of an inner close binary within a hierarchical triple system.
△ Less
Submitted 24 March, 2025;
originally announced March 2025.
-
New constraints on cosmic ray-boosted dark matter from the LUX-ZEPLIN experiment
Authors:
J. Aalbers,
D. S. Akerib,
A. K. Al Musalhi,
F. Alder,
C. S. Amarasinghe,
A. Ames,
T. J. Anderson,
N. Angelides,
H. M. Araujo,
J. E. Armstrong,
M. Arthurs,
A. Baker,
S. Balashov,
J. Bang,
J. W. Bargemann,
E. E. Barillier,
K. Beattie,
A. Bhatti,
A. Biekert,
T. P. Biesiadzinski,
H. J. Birch,
E. Bishop,
G. M. Blockinger,
B. Boxer,
C. A. J. Brew
, et al. (179 additional authors not shown)
Abstract:
While dual-phase xenon time projection chambers (TPCs) have driven the sensitivity towards weakly interacting massive particles (WIMPs) at the GeV/c^2 to TeV/c^2 mass scale, the scope for sub-GeV/c^2 dark matter particles is hindered by a limited nuclear recoil energy detection threshold. One approach to probe for lighter candidates is to consider cases where they have been boosted by collisions w…
▽ More
While dual-phase xenon time projection chambers (TPCs) have driven the sensitivity towards weakly interacting massive particles (WIMPs) at the GeV/c^2 to TeV/c^2 mass scale, the scope for sub-GeV/c^2 dark matter particles is hindered by a limited nuclear recoil energy detection threshold. One approach to probe for lighter candidates is to consider cases where they have been boosted by collisions with cosmic rays in the Milky Way, such that the additional kinetic energy lifts their induced signatures above the nominal threshold. In this Letter, we report first results of a search for cosmic ray-boosted dark matter (CRDM) with a combined 4.2 tonne-year exposure from the LUX-ZEPLIN (LZ) experiment. We observe no excess above the expected backgrounds and establish world-leading constraints on the spin-independent CRDM-nucleon cross section as small as 3.9 * 10^{-33} cm^2 at 90% confidence level for sub-GeV/c^2 masses.
△ Less
Submitted 2 June, 2025; v1 submitted 23 March, 2025;
originally announced March 2025.
-
Absence of dehydration due to superionic transition at Earth's core-mantle boundary
Authors:
Yu He,
Wei Zhang,
Qingyang Hu,
Shichuan Sun,
Jiaqi Hu,
Daohong Liu,
Li Zhou,
Lidong Dai,
Duck Young Kim,
Yun Liu,
Heping Li,
Ho-kwang Mao
Abstract:
The properties and stability of hydrous phases are key to unraveling the mysteries of the water cycle in Earth's interior. Under the deep lower mantle conditions, hydrous phases transition into a superionic state. However, the influence of the superionic effect on their stability and dehydration processes remains poorly understood. Using ab initio calculations and deep-learning potential molecular…
▽ More
The properties and stability of hydrous phases are key to unraveling the mysteries of the water cycle in Earth's interior. Under the deep lower mantle conditions, hydrous phases transition into a superionic state. However, the influence of the superionic effect on their stability and dehydration processes remains poorly understood. Using ab initio calculations and deep-learning potential molecular dynamics simulations, we discovered a doubly superionic transition in delta-AlOOH, characterized by the highly diffusive behavior of ionic hydrogen and aluminum within the oxygen sub-lattice. These highly diffusive elements contribute significant external entropy into the system, resulting in exceptional thermostability. Free energy calculations indicate that dehydration is energetically and kinetically unfavorable when water exists in a superionic state under core-mantle boundary (CMB) conditions. Consequently, water can accumulate in the deep lower mantle over Earth's history. This deep water reservoir plays a crucial role in the global deep water and hydrogen cycles.
△ Less
Submitted 22 March, 2025;
originally announced March 2025.
-
Building Resource-Constrained Language Agents: A Korean Case Study on Chemical Toxicity Information
Authors:
Hojun Cho,
Donghu Kim,
Soyoung Yang,
Chan Lee,
Hunjoo Lee,
Jaegul Choo
Abstract:
Language agents powered by large language models (LLMs) face significant deployment challenges in resource-constrained environments, particularly for specialized domains and less-common languages. This paper presents Tox-chat, a Korean chemical toxicity information agent devised within these limitations. We propose two key innovations: a context-efficient architecture that reduces token consumptio…
▽ More
Language agents powered by large language models (LLMs) face significant deployment challenges in resource-constrained environments, particularly for specialized domains and less-common languages. This paper presents Tox-chat, a Korean chemical toxicity information agent devised within these limitations. We propose two key innovations: a context-efficient architecture that reduces token consumption through hierarchical section search, and a scenario-based dialogue generation methodology that effectively distills tool-using capabilities from larger models. Experimental evaluations demonstrate that our fine-tuned 8B parameter model substantially outperforms both untuned models and baseline approaches, in terms of DB faithfulness and preference. Our work offers valuable insights for researchers developing domain-specific language agents under practical constraints.
△ Less
Submitted 22 March, 2025;
originally announced March 2025.
-
Measurements of the branching fractions of $Ξ_{c}^{+}\to Σ^{+}K_{S}^{0}$, $Ξ_{c}^{+}\to Ξ^{0}π^{+}$, and $Ξ_{c}^{+}\to Ξ^{0}K^{+}$ at Belle and Belle II
Authors:
Belle,
Belle II Collaborations,
:,
I. Adachi,
J. K. Ahn,
Y. Ahn,
N. Akopov,
S. Alghamdi,
M. Alhakami,
N. Althubiti,
K. Amos,
N. Anh Ky,
C. Antonioli,
D. M. Asner,
M. Aversano,
R. Ayad,
V. Babu,
N. K. Baghel,
P. Bambade,
Sw. Banerjee,
M. Barrett,
M. Bartl,
J. Baudot,
A. Beaubien,
F. Becherer
, et al. (335 additional authors not shown)
Abstract:
Using 983.0 $\rm{fb}^{-1}$ and 427.9 $\rm{fb}^{-1}$ data samples collected with the Belle and Belle II detectors at the KEKB and SuperKEKB asymmetric energy $e^+e^-$ colliders, respectively, we present studies of the Cabibbo-favored $Ξ_c^+$ decays ${Ξ_{c}^{+}\to Σ^{+}K_{S}^{0}}$ and $Ξ_{c}^{+}\to Ξ^{0}π^{+}$, and the singly Cabibbo-suppressed decay $Ξ_{c}^{+}\to Ξ^{0}K^{+}$. The ratios of branchin…
▽ More
Using 983.0 $\rm{fb}^{-1}$ and 427.9 $\rm{fb}^{-1}$ data samples collected with the Belle and Belle II detectors at the KEKB and SuperKEKB asymmetric energy $e^+e^-$ colliders, respectively, we present studies of the Cabibbo-favored $Ξ_c^+$ decays ${Ξ_{c}^{+}\to Σ^{+}K_{S}^{0}}$ and $Ξ_{c}^{+}\to Ξ^{0}π^{+}$, and the singly Cabibbo-suppressed decay $Ξ_{c}^{+}\to Ξ^{0}K^{+}$. The ratios of branching fractions of ${Ξ_{c}^{+}\to Σ^{+}K_{S}^{0}}$ and $Ξ_{c}^{+}\to Ξ^{0}K^{+}$ relative to that of $Ξ_{c}^{+}\toΞ^{-}π^{+}π^{+}$ are measured for the first time, while the ratio ${\cal B}(Ξ_{c}^{+}\toΞ^{0}π^{+})/{\cal B}(Ξ_{c}^{+}\toΞ^{-}π^{+}π^{+}) $ is also determined and improved by an order of magnitude in precision. The measured branching fraction ratios are $\frac{\cal{B}(Ξ_{c}^{+} \to Σ^{+}K_{S}^{0})}{\cal{B}(Ξ_{c}^{+}\to Ξ^{-}π^{+}π^+)}= 0.067 \pm 0.007 \pm 0.003$, $\frac{\cal{B}(Ξ_c^{+} \to Ξ^{0}π^{+})}{\cal{B}(Ξ_{c}^{+}\to Ξ^{-}π^{+}π^+)} = 0.248 \pm 0.005 \pm 0.009$, $\frac{\cal{B}(Ξ_c^{+} \to Ξ^{0}K^{+})}{\cal{B}(Ξ_{c}^{+}\to Ξ^{-}π^{+}π^+)} = 0.017 \pm 0.003 \pm 0.001$. Additionally, the ratio ${\cal B}(Ξ_{c}^{+}\toΞ^{0}K^{+})/{\cal B}(Ξ_{c}^{+}\toΞ^{0}π^{+})$ is measured to be $ 0.068 \pm 0.010 \pm 0.004$. Here, the first and second uncertainties are statistical and systematic, respectively. Multiplying the ratios by the branching fraction of the normalization mode, ${\mathcal B}(Ξ_{c}^{+}\toΞ^{-}π^{+}π^+)= (2.9\pm 1.3)\%$, we obtain the following absolute branching fractions ${\cal B}(Ξ_{c}^{+}\toΣ^{+}K^{0}_{S}) = (0.194 \pm 0.021 \pm 0.009 \pm 0.087 )%$, ${\cal B}(Ξ_{c}^{+}\toΞ^{0}π^{+}) = (0.719 \pm 0.014 \pm 0.024 \pm 0.322 )%$, ${\cal B}(Ξ_{c}^{+}\toΞ^{0}K^{+}) = (0.049 \pm 0.007 \pm 0.002 \pm 0.022 )%$.
△ Less
Submitted 22 March, 2025;
originally announced March 2025.
-
Predicting Potential Customer Support Needs and Optimizing Search Ranking in a Two-Sided Marketplace
Authors:
Do-kyum Kim,
Han Zhao,
Huiji Gao,
Liwei He,
Malay Haldar,
Sanjeev Katariya
Abstract:
Airbnb is an online marketplace that connects hosts and guests to unique stays and experiences. When guests stay at homes booked on Airbnb, there are a small fraction of stays that lead to support needed from Airbnb's Customer Support (CS), which may cause inconvenience to guests and hosts and require Airbnb resources to resolve. In this work, we show that instances where CS support is needed may…
▽ More
Airbnb is an online marketplace that connects hosts and guests to unique stays and experiences. When guests stay at homes booked on Airbnb, there are a small fraction of stays that lead to support needed from Airbnb's Customer Support (CS), which may cause inconvenience to guests and hosts and require Airbnb resources to resolve. In this work, we show that instances where CS support is needed may be predicted based on hosts and guests behavior. We build a model to predict the likelihood of CS support needs for each match of guest and host. The model score is incorporated into Airbnb's search ranking algorithm as one of the many factors. The change promotes more reliable matches in search results and significantly reduces bookings that require CS support.
△ Less
Submitted 21 March, 2025;
originally announced March 2025.
-
Classifier-guided CLIP Distillation for Unsupervised Multi-label Classification
Authors:
Dongseob Kim,
Hyunjung Shim
Abstract:
Multi-label classification is crucial for comprehensive image understanding, yet acquiring accurate annotations is challenging and costly. To address this, a recent study suggests exploiting unsupervised multi-label classification leveraging CLIP, a powerful vision-language model. Despite CLIP's proficiency, it suffers from view-dependent predictions and inherent bias, limiting its effectiveness.…
▽ More
Multi-label classification is crucial for comprehensive image understanding, yet acquiring accurate annotations is challenging and costly. To address this, a recent study suggests exploiting unsupervised multi-label classification leveraging CLIP, a powerful vision-language model. Despite CLIP's proficiency, it suffers from view-dependent predictions and inherent bias, limiting its effectiveness. We propose a novel method that addresses these issues by leveraging multiple views near target objects, guided by Class Activation Mapping (CAM) of the classifier, and debiasing pseudo-labels derived from CLIP predictions. Our Classifier-guided CLIP Distillation (CCD) enables selecting multiple local views without extra labels and debiasing predictions to enhance classification performance. Experimental results validate our method's superiority over existing techniques across diverse datasets. The code is available at https://github.com/k0u-id/CCD.
△ Less
Submitted 21 March, 2025;
originally announced March 2025.
-
Advancing Human-Machine Teaming: Concepts, Challenges, and Applications
Authors:
Dian Chen,
Han Jun Yoon,
Zelin Wan,
Nithin Alluru,
Sang Won Lee,
Richard He,
Terrence J. Moore,
Frederica F. Nelson,
Sunghyun Yoon,
Hyuk Lim,
Dan Dongseong Kim,
Jin-Hee Cho
Abstract:
Human-Machine Teaming (HMT) is revolutionizing collaboration across domains such as defense, healthcare, and autonomous systems by integrating AI-driven decision-making, trust calibration, and adaptive teaming. This survey presents a comprehensive taxonomy of HMT, analyzing theoretical models, including reinforcement learning, instance-based learning, and interdependence theory, alongside interdis…
▽ More
Human-Machine Teaming (HMT) is revolutionizing collaboration across domains such as defense, healthcare, and autonomous systems by integrating AI-driven decision-making, trust calibration, and adaptive teaming. This survey presents a comprehensive taxonomy of HMT, analyzing theoretical models, including reinforcement learning, instance-based learning, and interdependence theory, alongside interdisciplinary methodologies. Unlike prior reviews, we examine team cognition, ethical AI, multi-modal interactions, and real-world evaluation frameworks. Key challenges include explainability, role allocation, and scalable benchmarking. We propose future research in cross-domain adaptation, trust-aware AI, and standardized testbeds. By bridging computational and social sciences, this work lays a foundation for resilient, ethical, and scalable HMT systems.
△ Less
Submitted 6 May, 2025; v1 submitted 16 March, 2025;
originally announced March 2025.
-
VerbDiff: Text-Only Diffusion Models with Enhanced Interaction Awareness
Authors:
SeungJu Cha,
Kwanyoung Lee,
Ye-Chan Kim,
Hyunwoo Oh,
Dong-Jin Kim
Abstract:
Recent large-scale text-to-image diffusion models generate photorealistic images but often struggle to accurately depict interactions between humans and objects due to their limited ability to differentiate various interaction words. In this work, we propose VerbDiff to address the challenge of capturing nuanced interactions within text-to-image diffusion models. VerbDiff is a novel text-to-image…
▽ More
Recent large-scale text-to-image diffusion models generate photorealistic images but often struggle to accurately depict interactions between humans and objects due to their limited ability to differentiate various interaction words. In this work, we propose VerbDiff to address the challenge of capturing nuanced interactions within text-to-image diffusion models. VerbDiff is a novel text-to-image generation model that weakens the bias between interaction words and objects, enhancing the understanding of interactions. Specifically, we disentangle various interaction words from frequency-based anchor words and leverage localized interaction regions from generated images to help the model better capture semantics in distinctive words without extra conditions. Our approach enables the model to accurately understand the intended interaction between humans and objects, producing high-quality images with accurate interactions aligned with specified verbs. Extensive experiments on the HICO-DET dataset demonstrate the effectiveness of our method compared to previous approaches.
△ Less
Submitted 20 March, 2025;
originally announced March 2025.
-
Analyses of anomalous lensing events detected from the UKIRT microlensing survey
Authors:
Cheongho Han,
Weicheng Zang,
Andrzej Udalski,
Chung-Uk Lee,
Ian A. Bond,
Yongxin Wen,
Bo Ma,
Michael D. Albrow,
Sun-Ju Chung,
Andrew Gould,
Kyu-Ha Hwang,
Youn Kil Jung,
Yoon-Hyun Ryu,
Yossi Shvartzvald,
In-Gu Shin,
Hongjing Yang,
Jennifer C. Yee,
Doeon Kim,
Dong-Jin Kim,
Sang-Mok Cha,
Seung-Lee Kim,
Dong-Joo Lee,
Yongseok Lee,
Byeong-Gon Park,
Richard W. Pogge
, et al. (39 additional authors not shown)
Abstract:
The United Kingdom Infrared Telescope (UKIRT) microlensing survey was conducted over four years, from 2016 to 2019, with the goal of serving as a precursor to future near-infrared microlensing surveys (Shvartzvald et al. 2017). Focusing on stars in the Galactic center and utilizing near-infrared passbands, the survey identified approximately one thousand microlensing events, 27 of which displayed…
▽ More
The United Kingdom Infrared Telescope (UKIRT) microlensing survey was conducted over four years, from 2016 to 2019, with the goal of serving as a precursor to future near-infrared microlensing surveys (Shvartzvald et al. 2017). Focusing on stars in the Galactic center and utilizing near-infrared passbands, the survey identified approximately one thousand microlensing events, 27 of which displayed anomalies in their light curves (Wen et al. 2023). This paper presents an analysis of these anomalous events, aiming to uncover the underlying causes of the observed anomalies. The events were analyzed under various configurations, considering the potential binarity of both the lens and the source. For 11 events that were additionally observed by other optical microlensing surveys, including those conducted by the OGLE, KMTNet, and MOA collaborations, we incorporated their data into our analysis. Among the reported anomalous events, we revealed the nature of 24 events except for three events, in which one was likely to be a transient variable, and two were were difficult to accurately characterize their nature due to the limitations of the available data. We confirmed the binary lens nature of the anomalies in 22 events. Among these, we verified the earlier discovery that the companion in the binary lens system UKIRT11L is a planetary object. Accurately describing the anomaly in UKIRT21 required a model that accounted for the binarity of both the lens and the source. For two events UKIRT01 and UKIRT17, the anomalies could be interpreted using either a binary-source or a binary-lens model.
△ Less
Submitted 18 March, 2025;
originally announced March 2025.
-
Testing Conditional Stochastic Dominance at Target Points
Authors:
Federico A. Bugni,
Ivan A. Canay,
Deborah Kim
Abstract:
This paper introduces a novel test for conditional stochastic dominance (CSD) at specific values of the conditioning covariates, referred to as target points. The test is relevant for analyzing income inequality, evaluating treatment effects, and studying discrimination. We propose a Kolmogorov--Smirnov-type test statistic that utilizes induced order statistics from independent samples. Notably, t…
▽ More
This paper introduces a novel test for conditional stochastic dominance (CSD) at specific values of the conditioning covariates, referred to as target points. The test is relevant for analyzing income inequality, evaluating treatment effects, and studying discrimination. We propose a Kolmogorov--Smirnov-type test statistic that utilizes induced order statistics from independent samples. Notably, the test features a data-independent critical value, eliminating the need for resampling techniques such as the bootstrap. Our approach avoids kernel smoothing and parametric assumptions, instead relying on a tuning parameter to select relevant observations. We establish the asymptotic properties of our test, showing that the induced order statistics converge to independent draws from the true conditional distributions and that the test is asymptotically of level $α$ under weak regularity conditions. While our results apply to both continuous and discrete data, in the discrete case, the critical value only provides a valid upper bound. To address this, we propose a refined critical value that significantly enhances power, requiring only knowledge of the support size of the distributions. Additionally, we analyze the test's behavior in the limit experiment, demonstrating that it reduces to a problem analogous to testing unconditional stochastic dominance in finite samples. This framework allows us to prove the validity of permutation-based tests for stochastic dominance when the random variables are continuous. Monte Carlo simulations confirm the strong finite-sample performance of our method.
△ Less
Submitted 20 April, 2025; v1 submitted 18 March, 2025;
originally announced March 2025.
-
Conversational Agents as Catalysts for Critical Thinking: Challenging Social Influence in Group Decision-making
Authors:
Soohwan Lee,
Seoyeong Hwang,
Dajung Kim,
Kyungho Lee
Abstract:
Group decision-making processes frequently suffer when social influence and power dynamics suppress minority viewpoints, leading to compliance and groupthink. Conversational agents can counteract these harmful dynamics by encouraging critical thinking. This study investigates how LLM-powered devil's advocate systems affect psychological safety, opinion expression, and satisfaction in power-imbalan…
▽ More
Group decision-making processes frequently suffer when social influence and power dynamics suppress minority viewpoints, leading to compliance and groupthink. Conversational agents can counteract these harmful dynamics by encouraging critical thinking. This study investigates how LLM-powered devil's advocate systems affect psychological safety, opinion expression, and satisfaction in power-imbalanced group dynamics. We conducted an experiment with 48 participants in 12 four-person groups, each containing three high-power (senior) and one low-power (junior) member. Each group completed decision tasks in both baseline and AI intervention conditions. Results show AI counterarguments fostered a more flexible atmosphere and significantly enhanced both process and outcome satisfaction for all participants, with particularly notable improvements for minority members. Cognitive workload increased slightly, though not significantly. This research contributes empirical evidence on how AI systems can effectively navigate power hierarchies to foster more inclusive decision-making environments, highlighting the importance of balancing intervention frequency, maintaining conversational flow, and preserving group cohesion.
△ Less
Submitted 18 March, 2025;
originally announced March 2025.
-
MMR: A Large-scale Benchmark Dataset for Multi-target and Multi-granularity Reasoning Segmentation
Authors:
Donggon Jang,
Yucheol Cho,
Suin Lee,
Taehyeon Kim,
Dae-Shik Kim
Abstract:
The fusion of Large Language Models with vision models is pioneering new possibilities in user-interactive vision-language tasks. A notable application is reasoning segmentation, where models generate pixel-level segmentation masks by comprehending implicit meanings in human instructions. However, seamless human-AI interaction demands more than just object-level recognition; it requires understand…
▽ More
The fusion of Large Language Models with vision models is pioneering new possibilities in user-interactive vision-language tasks. A notable application is reasoning segmentation, where models generate pixel-level segmentation masks by comprehending implicit meanings in human instructions. However, seamless human-AI interaction demands more than just object-level recognition; it requires understanding both objects and the functions of their detailed parts, particularly in multi-target scenarios. For example, when instructing a robot to \textit{turn on the TV"}, there could be various ways to accomplish this command. Recognizing multiple objects capable of turning on the TV, such as the TV itself or a remote control (multi-target), provides more flexible options and aids in finding the optimized scenario. Furthermore, understanding specific parts of these objects, like the TV's button or the remote's button (part-level), is important for completing the action. Unfortunately, current reasoning segmentation datasets predominantly focus on a single target object-level reasoning, which limits the detailed recognition of an object's parts in multi-target contexts. To address this gap, we construct a large-scale dataset called Multi-target and Multi-granularity Reasoning (MMR). MMR comprises 194K complex and implicit instructions that consider multi-target, object-level, and part-level aspects, based on pre-existing image-mask sets. This dataset supports diverse and context-aware interactions by hierarchically providing object and part information. Moreover, we propose a straightforward yet effective framework for multi-target, object-level, and part-level reasoning segmentation. Experimental results on MMR show that the proposed method can reason effectively in multi-target and multi-granularity scenarios, while the existing reasoning segmentation model still has room for improvement.
△ Less
Submitted 18 March, 2025;
originally announced March 2025.
-
Direct Detection of Fast-Moving Low-Mass Dark Matter
Authors:
Haider Alhazmi,
Doojin Kim,
Kyoungchul Kong,
Gopolang Mohlabeng,
Jong-Chul Park,
Seodong Shin
Abstract:
We examine the signals produced by dark matter interactions with electrons, which play a crucial role in direct detection experiments employing heavy target materials, particularly in many well-motivated sub-GeV dark matter scenarios. When the momentum transfer to target electrons is comparable to or exceeds their binding energy, atomic effects related to electron ionization become essential for a…
▽ More
We examine the signals produced by dark matter interactions with electrons, which play a crucial role in direct detection experiments employing heavy target materials, particularly in many well-motivated sub-GeV dark matter scenarios. When the momentum transfer to target electrons is comparable to or exceeds their binding energy, atomic effects related to electron ionization become essential for accurately determining signal rates - especially in the case of fast-moving dark matter. In this paper, we revisit and extend the atomic ionization formalism, systematically comparing different approaches used to formulate the ionization form factor and identifying their respective domains of validity. As practical applications, we explore detection prospects in xenon target experiments. To illustrate our findings, we consider a specific scenario involving boosted dark matter, which often leads to high-momentum electron recoils. Our analysis demonstrates that the choice of formalism can significantly influence the interpretation of experimental data, depending on the regions of parameter space.
△ Less
Submitted 17 March, 2025;
originally announced March 2025.
-
DAPI: Domain Adaptive Toxicity Probe Vector Intervention for Fine-Grained Detoxification
Authors:
Cho Hyeonsu,
Dooyoung Kim,
Youngjoong Ko
Abstract:
There have been attempts to utilize linear probe for detoxification, with existing studies relying on a single toxicity probe vector to reduce toxicity. However, toxicity can be fine-grained into various subcategories, making it difficult to remove certain types of toxicity by using a single toxicity probe vector. To address this limitation, we propose a category-specific toxicity probe vector app…
▽ More
There have been attempts to utilize linear probe for detoxification, with existing studies relying on a single toxicity probe vector to reduce toxicity. However, toxicity can be fine-grained into various subcategories, making it difficult to remove certain types of toxicity by using a single toxicity probe vector. To address this limitation, we propose a category-specific toxicity probe vector approach. First, we train multiple toxicity probe vectors for different toxicity categories. During generation, we dynamically select the most relevant toxicity probe vector based on the current context. Finally, the selected vector is dynamically scaled and subtracted from model. Our method successfully mitigated toxicity from categories that the single probe vector approach failed to detoxify. Experiments demonstrate that our approach achieves up to a 78.52% reduction in toxicity on the evaluation dataset, while fluency remains nearly unchanged, with only a 0.052% drop compared to the unsteered model.
△ Less
Submitted 17 March, 2025;
originally announced March 2025.
-
IPCGRL: Language-Instructed Reinforcement Learning for Procedural Level Generation
Authors:
In-Chang Baek,
Sung-Hyun Kim,
Seo-Young Lee,
Dong-Hyeon Kim,
Kyung-Joong Kim
Abstract:
Recent research has highlighted the significance of natural language in enhancing the controllability of generative models. While various efforts have been made to leverage natural language for content generation, research on deep reinforcement learning (DRL) agents utilizing text-based instructions for procedural content generation remains limited. In this paper, we propose IPCGRL, an instruction…
▽ More
Recent research has highlighted the significance of natural language in enhancing the controllability of generative models. While various efforts have been made to leverage natural language for content generation, research on deep reinforcement learning (DRL) agents utilizing text-based instructions for procedural content generation remains limited. In this paper, we propose IPCGRL, an instruction-based procedural content generation method via reinforcement learning, which incorporates a sentence embedding model. IPCGRL fine-tunes task-specific embedding representations to effectively compress game-level conditions. We evaluate IPCGRL in a two-dimensional level generation task and compare its performance with a general-purpose embedding method. The results indicate that IPCGRL achieves up to a 21.4% improvement in controllability and a 17.2% improvement in generalizability for unseen instructions. Furthermore, the proposed method extends the modality of conditional input, enabling a more flexible and expressive interaction framework for procedural content generation.
△ Less
Submitted 24 March, 2025; v1 submitted 16 March, 2025;
originally announced March 2025.
-
A 28 nm AI microcontroller with tightly coupled zero-standby power weight memory featuring standard logic compatible 4 Mb 4-bits/cell embedded flash technology
Authors:
Daewung Kim,
Seong Hwan Jeon,
Young Hee Jeon,
Kyung-Bae Kwon,
Jigon Kim,
Yeounghun Choi,
Hyunseung Cha,
Kitae Kwon,
Daesik Park,
Jongseuk Lee,
Sihwan Kim,
Seung-Hwan Song
Abstract:
This study introduces a novel AI microcontroller optimized for cost-effective, battery-powered edge AI applications. Unlike traditional single bit/cell memory configurations, the proposed microcontroller integrates zero-standby power weight memory featuring standard logic compatible 4-bits/cell embedded flash technology tightly coupled to a Near-Memory Computing Unit. This architecture enables eff…
▽ More
This study introduces a novel AI microcontroller optimized for cost-effective, battery-powered edge AI applications. Unlike traditional single bit/cell memory configurations, the proposed microcontroller integrates zero-standby power weight memory featuring standard logic compatible 4-bits/cell embedded flash technology tightly coupled to a Near-Memory Computing Unit. This architecture enables efficient and low-power AI acceleration. Advanced state mapping and an overstress-free word line (WL) driver circuit extend verify levels, ensuring robust 16 state cell margin. A ping-pong buffer reduces internal data movement while supporting simultaneous multi-bit processing. The fabricated microcontroller demonstrated high reliability, maintaining accuracy after 160 hours of unpowered baking at 125$^\circ$C.
△ Less
Submitted 12 February, 2025;
originally announced March 2025.
-
Subnet-Aware Dynamic Supernet Training for Neural Architecture Search
Authors:
Jeimin Jeon,
Youngmin Oh,
Junghyup Lee,
Donghyeon Baek,
Dohyung Kim,
Chanho Eom,
Bumsub Ham
Abstract:
N-shot neural architecture search (NAS) exploits a supernet containing all candidate subnets for a given search space. The subnets are typically trained with a static training strategy (e.g., using the same learning rate (LR) scheduler and optimizer for all subnets). This, however, does not consider that individual subnets have distinct characteristics, leading to two problems: (1) The supernet tr…
▽ More
N-shot neural architecture search (NAS) exploits a supernet containing all candidate subnets for a given search space. The subnets are typically trained with a static training strategy (e.g., using the same learning rate (LR) scheduler and optimizer for all subnets). This, however, does not consider that individual subnets have distinct characteristics, leading to two problems: (1) The supernet training is biased towards the low-complexity subnets (unfairness); (2) the momentum update in the supernet is noisy (noisy momentum). We present a dynamic supernet training technique to address these problems by adjusting the training strategy adaptive to the subnets. Specifically, we introduce a complexity-aware LR scheduler (CaLR) that controls the decay ratio of LR adaptive to the complexities of subnets, which alleviates the unfairness problem. We also present a momentum separation technique (MS). It groups the subnets with similar structural characteristics and uses a separate momentum for each group, avoiding the noisy momentum problem. Our approach can be applicable to various N-shot NAS methods with marginal cost, while improving the search performance drastically. We validate the effectiveness of our approach on various search spaces (e.g., NAS-Bench-201, Mobilenet spaces) and datasets (e.g., CIFAR-10/100, ImageNet).
△ Less
Submitted 13 March, 2025;
originally announced March 2025.
-
Spivey's type recurrence relation for Lah-Bell polynomials
Authors:
Taekyun Kim,
Dae San Kim
Abstract:
The aim of this paper is to derive Spivey's type recurrence relations for the Lah-Bell polynomials and the r-Lah-Bell polynomials by utilizing operators X and D satisfying the commutation relation DX-XD=1.
Here X is the `multiplication by x' operator and D is the differentiation operator D=d/dx. In addition, we obtain Spivey's type recurrence relation for the lambda analogue of r-Lah-Bell polyno…
▽ More
The aim of this paper is to derive Spivey's type recurrence relations for the Lah-Bell polynomials and the r-Lah-Bell polynomials by utilizing operators X and D satisfying the commutation relation DX-XD=1.
Here X is the `multiplication by x' operator and D is the differentiation operator D=d/dx. In addition, we obtain Spivey's type recurrence relation for the lambda analogue of r-Lah-Bell polynomials by some other method without using the operators X and D.
△ Less
Submitted 13 March, 2025;
originally announced March 2025.
-
Fourier Decomposition for Explicit Representation of 3D Point Cloud Attributes
Authors:
Donghyun Kim,
Hyunah Ko,
Chanyoung Kim,
Seong Jae Hwang
Abstract:
While 3D point clouds are widely utilized across various vision applications, their irregular and sparse nature make them challenging to handle. In response, numerous encoding approaches have been proposed to capture the rich semantic information of point clouds. Yet, a critical limitation persists: a lack of consideration for colored point clouds which are more capable 3D representations as they…
▽ More
While 3D point clouds are widely utilized across various vision applications, their irregular and sparse nature make them challenging to handle. In response, numerous encoding approaches have been proposed to capture the rich semantic information of point clouds. Yet, a critical limitation persists: a lack of consideration for colored point clouds which are more capable 3D representations as they contain diverse attributes: color and geometry. While existing methods handle these attributes separately on a per-point basis, this leads to a limited receptive field and restricted ability to capture relationships across multiple points. To address this, we pioneer a point cloud encoding methodology that leverages 3D Fourier decomposition to disentangle color and geometric features while extending the receptive field through spectral-domain operations. Our analysis confirms that this encoding approach effectively separates feature components, where the amplitude uniquely captures color attributes and the phase encodes geometric structure, thereby enabling independent learning and utilization of both attributes. Furthermore, the spectral-domain properties of these components naturally aggregate local features while considering multiple points' information. We validate our point cloud encoding approach on point cloud classification and style transfer tasks, achieving state-of-the-art results on the DensePoint dataset with improvements via a proposed amplitude-based data augmentation strategy.
△ Less
Submitted 13 March, 2025;
originally announced March 2025.
-
JWST Spectroscopic View of a Rapidly Growing Dust-obscured Quasar at $z \sim 4$: Effect of Dust Extinction Correction on Black Hole Mass and Eddington Ratio Estimation
Authors:
Dohyeong Kim,
Myungshin Im
Abstract:
In the merger-driven galaxy evolution scenario, the central supermassive black holes (SMBHs) in dust obscured galaxies grow rapidly. Interestingly, a recent work \citep{suh24} on a dust-obscured galaxy, LID-568 at $z=3.965$, has shown that its SMBH is growing extremely fast at about 40 times of the Eddington-limited accretion rate (i.e., super-Eddington accretion). However, the heavy dust extincti…
▽ More
In the merger-driven galaxy evolution scenario, the central supermassive black holes (SMBHs) in dust obscured galaxies grow rapidly. Interestingly, a recent work \citep{suh24} on a dust-obscured galaxy, LID-568 at $z=3.965$, has shown that its SMBH is growing extremely fast at about 40 times of the Eddington-limited accretion rate (i.e., super-Eddington accretion). However, the heavy dust extinction of the host galaxy could affect the result if not corrected properly. Here, we analyze James Webb Space Telescope (JWST) NIRSpec/IFU and MIRI spectra of LID-568. By measuring its bolometric luminosity ($L_{\rm bol}$) and BH mass ($M_{\rm BH}$) using an extinction-free estimator based on mid-infrared spectra, we obtain $L_{\rm bol} = 10^{46.83\pm0.07}\,{\rm erg\,s^{-1}}$ and $M_{\rm BH} = 10^{8.43\pm0.15}\,M_{\odot}$. The measured Eddington ratio ($λ_{\rm Edd}$) is 1.97$\pm$0.88, consistent with the accretion rate at the Eddington limit; in other words, not in super-Eddington in a significant manner. This result underscores challenges and the importance of carefully considering dust extinction when analyzing the BH growth in dust-obscured quasars.
△ Less
Submitted 11 March, 2025;
originally announced March 2025.
-
TelePix2: Full scale fast region of interest trigger and timing for the EUDET-style telescopes at the DESY II Test Beam Facility
Authors:
Lennart Huth,
Heiko Augustin,
Lucas Dittmann,
Sebastian Dittmeier,
Jan Hammerich,
Yajun He,
Adrian Herkert,
Dohun Kim,
Uwe Krämer,
David Maximillian Immig,
Ruben Kolb,
Ivan Perić,
Bernhard Pilsl,
Thomas Senger,
Sara Ruiz Daza,
Andre Schöning,
Marcel Stanitzki Benjamin Weinläder,
Arianna Wintle
Abstract:
With increasing demands by future and current upgrades of particle physics experiments on rate capabilities and time resolution, the requirements on test beams are also increasing. The current infrastructure at the DESY II test beam facility includes particle tracking telescopes with long integration times, no additional timing but excellent spatial resolution. This results in readouts with multip…
▽ More
With increasing demands by future and current upgrades of particle physics experiments on rate capabilities and time resolution, the requirements on test beams are also increasing. The current infrastructure at the DESY II test beam facility includes particle tracking telescopes with long integration times, no additional timing but excellent spatial resolution. This results in readouts with multiple particles per trigger, causing ambiguities in tracking and assigning particles to triggers. Also, it is likely not to trigger on particles that pass through a small device under test, leading to inefficient data taking. These issues can be solved by adding TelePix2 as a timing and flexible region of interest trigger layer. TelePix2 is a full scale HV-CMOS chip based on the successful small scale prototype TelePix. The DAQ system and the sensors performance featuring efficiencies above 99 % and a time resolution of 3.844(2) ns are presented. The integration into EUDAQ2 and the AIDA-TLU to seamlessly work in the test beam environment as well as into the analysis chain is described. First successful use cases are highlighted to conclude that TelePix2 is a well-suited timing and trigger layer for test beams
△ Less
Submitted 11 March, 2025;
originally announced March 2025.
-
Are We Truly Forgetting? A Critical Re-examination of Machine Unlearning Evaluation Protocols
Authors:
Yongwoo Kim,
Sungmin Cha,
Donghyun Kim
Abstract:
Machine unlearning is a process to remove specific data points from a trained model while maintaining the performance on retain data, addressing privacy or legal requirements. Despite its importance, existing unlearning evaluations tend to focus on logit-based metrics (i.e., accuracy) under small-scale scenarios. We observe that this could lead to a false sense of security in unlearning approaches…
▽ More
Machine unlearning is a process to remove specific data points from a trained model while maintaining the performance on retain data, addressing privacy or legal requirements. Despite its importance, existing unlearning evaluations tend to focus on logit-based metrics (i.e., accuracy) under small-scale scenarios. We observe that this could lead to a false sense of security in unlearning approaches under real-world scenarios. In this paper, we conduct a new comprehensive evaluation that employs representation-based evaluations of the unlearned model under large-scale scenarios to verify whether the unlearning approaches genuinely eliminate the targeted forget data from the model's representation perspective. Our analysis reveals that current state-of-the-art unlearning approaches either completely degrade the representational quality of the unlearned model or merely modify the classifier (i.e., the last layer), thereby achieving superior logit-based evaluation metrics while maintaining significant representational similarity to the original model. Furthermore, we introduce a rigorous unlearning evaluation setup, in which the forgetting classes exhibit semantic similarity to downstream task classes, necessitating that feature representations diverge significantly from those of the original model, thus enabling a more rigorous evaluation from a representation perspective. We hope our benchmark serves as a standardized protocol for evaluating unlearning algorithms under realistic conditions.
△ Less
Submitted 16 May, 2025; v1 submitted 10 March, 2025;
originally announced March 2025.
-
Probabilistic degenerate poly-Bell polynomials associated with random variables
Authors:
Pengxiang Xue,
Yuankui Ma,
Taekyun Kim,
Dae San Kim,
Wenpeng Zhang
Abstract:
Let Y be a random variable whose moment generating function exists in a neighborhood of the origin. The aim of this paper is to study the probabilistic degenerate poly-Bell polynomials associated with the random variable Y, arising from the degenerate polyexponential functions, which are probabilistic extensions of degenerate versions of the poly-Bell polynomials. We derive several explicit expres…
▽ More
Let Y be a random variable whose moment generating function exists in a neighborhood of the origin. The aim of this paper is to study the probabilistic degenerate poly-Bell polynomials associated with the random variable Y, arising from the degenerate polyexponential functions, which are probabilistic extensions of degenerate versions of the poly-Bell polynomials. We derive several explicit expressions and some related identities for them. In addition, we consider the special cases that Y is the Bernoulli random variable with probability of success p or the gamma random variable with parameters 1,1.
△ Less
Submitted 9 March, 2025;
originally announced March 2025.
-
Measurements and models of enhanced recombination following inner-shell vacancies in liquid xenon
Authors:
J. Aalbers,
D. S. Akerib,
A. K. Al Musalhi,
F. Alder,
C. S. Amarasinghe,
A. Ames,
T. J. Anderson,
N. Angelides,
H. M. Araújo,
J. E. Armstrong,
M. Arthurs,
A. Baker,
S. Balashov,
J. Bang,
J. W. Bargemann,
E. E. Barillier,
D. Bauer,
K. Beattie,
T. Benson,
A. Bhatti,
A. Biekert,
T. P. Biesiadzinski,
H. J. Birch,
E. Bishop,
G. M. Blockinger
, et al. (193 additional authors not shown)
Abstract:
Electron-capture decays of $^{125}$Xe and $^{127}$Xe, and double-electron-capture decays of $^{124}$Xe, are backgrounds in searches for weakly interacting massive particles (WIMPs) conducted by dual-phase xenon time projection chambers such as LUX-ZEPLIN (LZ). These decays produce signals with more light and less charge than equivalent-energy $β$ decays, and correspondingly overlap more with WIMP…
▽ More
Electron-capture decays of $^{125}$Xe and $^{127}$Xe, and double-electron-capture decays of $^{124}$Xe, are backgrounds in searches for weakly interacting massive particles (WIMPs) conducted by dual-phase xenon time projection chambers such as LUX-ZEPLIN (LZ). These decays produce signals with more light and less charge than equivalent-energy $β$ decays, and correspondingly overlap more with WIMP signals. We measure three electron-capture charge yields in LZ: the 1.1~keV M-shell, 5.2~keV L-shell, and 33.2~keV K-shell at drift fields of 193 and 96.5~V/cm. The LL double-electron-capture decay of $^{124}$Xe exhibits even more pronounced shifts in charge and light. We provide a first model of double-electron-capture charge yields using the link between ionization density and electron-ion recombination, and identify a need for more accurate calculations. Finally, we discuss the implications of the reduced charge yield of these decays and other interactions creating inner-shell vacancies for future dark matter searches.
△ Less
Submitted 17 June, 2025; v1 submitted 7 March, 2025;
originally announced March 2025.
-
CoMoGaussian: Continuous Motion-Aware Gaussian Splatting from Motion-Blurred Images
Authors:
Jungho Lee,
Donghyeong Kim,
Dogyoon Lee,
Suhwan Cho,
Minhyeok Lee,
Wonjoon Lee,
Taeoh Kim,
Dongyoon Wee,
Sangyoun Lee
Abstract:
3D Gaussian Splatting (3DGS) has gained significant attention for their high-quality novel view rendering, motivating research to address real-world challenges. A critical issue is the camera motion blur caused by movement during exposure, which hinders accurate 3D scene reconstruction. In this study, we propose CoMoGaussian, a Continuous Motion-Aware Gaussian Splatting that reconstructs precise 3…
▽ More
3D Gaussian Splatting (3DGS) has gained significant attention for their high-quality novel view rendering, motivating research to address real-world challenges. A critical issue is the camera motion blur caused by movement during exposure, which hinders accurate 3D scene reconstruction. In this study, we propose CoMoGaussian, a Continuous Motion-Aware Gaussian Splatting that reconstructs precise 3D scenes from motion-blurred images while maintaining real-time rendering speed. Considering the complex motion patterns inherent in real-world camera movements, we predict continuous camera trajectories using neural ordinary differential equations (ODEs). To ensure accurate modeling, we employ rigid body transformations, preserving the shape and size of the object but rely on the discrete integration of sampled frames. To better approximate the continuous nature of motion blur, we introduce a continuous motion refinement (CMR) transformation that refines rigid transformations by incorporating additional learnable parameters. By revisiting fundamental camera theory and leveraging advanced neural ODE techniques, we achieve precise modeling of continuous camera trajectories, leading to improved reconstruction accuracy. Extensive experiments demonstrate state-of-the-art performance both quantitatively and qualitatively on benchmark datasets, which include a wide range of motion blur scenarios, from moderate to extreme blur.
△ Less
Submitted 7 March, 2025;
originally announced March 2025.
-
Piccolo: Large-Scale Graph Processing with Fine-Grained In-Memory Scatter-Gather
Authors:
Changmin Shin,
Jaeyong Song,
Hongsun Jang,
Dogeun Kim,
Jun Sung,
Taehee Kwon,
Jae Hyung Ju,
Frank Liu,
Yeonkyu Choi,
Jinho Lee
Abstract:
Graph processing requires irregular, fine-grained random access patterns incompatible with contemporary off-chip memory architecture, leading to inefficient data access. This inefficiency makes graph processing an extremely memory-bound application. Because of this, existing graph processing accelerators typically employ a graph tiling-based or processing-in-memory (PIM) approach to relieve the me…
▽ More
Graph processing requires irregular, fine-grained random access patterns incompatible with contemporary off-chip memory architecture, leading to inefficient data access. This inefficiency makes graph processing an extremely memory-bound application. Because of this, existing graph processing accelerators typically employ a graph tiling-based or processing-in-memory (PIM) approach to relieve the memory bottleneck. In the tiling-based approach, a graph is split into chunks that fit within the on-chip cache to maximize data reuse. In the PIM approach, arithmetic units are placed within memory to perform operations such as reduction or atomic addition. However, both approaches have several limitations, especially when implemented on current memory standards (i.e., DDR). Because the access granularity provided by DDR is much larger than that of the graph vertex property data, much of the bandwidth and cache capacity are wasted. PIM is meant to alleviate such issues, but it is difficult to use in conjunction with the tiling-based approach, resulting in a significant disadvantage. Furthermore, placing arithmetic units inside a memory chip is expensive, thereby supporting multiple types of operation is thought to be impractical. To address the above limitations, we present Piccolo, an end-to-end efficient graph processing accelerator with fine-grained in-memory random scatter-gather. Instead of placing expensive arithmetic units in off-chip memory, Piccolo focuses on reducing the off-chip traffic with non-arithmetic function-in-memory of random scatter-gather. To fully benefit from in-memory scatter-gather, Piccolo redesigns the cache and MHA of the accelerator such that it can enjoy both the advantage of tiling and in-memory operations. Piccolo achieves a maximum speedup of 3.28$\times$ and a geometric mean speedup of 1.62$\times$ across various and extensive benchmarks.
△ Less
Submitted 9 March, 2025; v1 submitted 6 March, 2025;
originally announced March 2025.
-
AOLO: Analysis and Optimization For Low-Carbon Oriented Wireless Large Language Model Services
Authors:
Xiaoqi Wang,
Hongyang Du,
Yuehong Gao,
Dong In Kim
Abstract:
Recent advancements in large language models (LLMs) have led to their widespread adoption and large-scale deployment across various domains. However, their environmental impact, particularly during inference, has become a growing concern due to their substantial energy consumption and carbon footprint. Existing research has focused on inference computation alone, overlooking the analysis and optim…
▽ More
Recent advancements in large language models (LLMs) have led to their widespread adoption and large-scale deployment across various domains. However, their environmental impact, particularly during inference, has become a growing concern due to their substantial energy consumption and carbon footprint. Existing research has focused on inference computation alone, overlooking the analysis and optimization of carbon footprint in network-aided LLM service systems. To address this gap, we propose AOLO, a framework for analysis and optimization for low-carbon oriented wireless LLM services. AOLO introduces a comprehensive carbon footprint model that quantifies greenhouse gas emissions across the entire LLM service chain, including computational inference and wireless communication. Furthermore, we formulate an optimization problem aimed at minimizing the overall carbon footprint, which is solved through joint optimization of inference outputs and transmit power under quality-of-experience and system performance constraints. To achieve this joint optimization, we leverage the energy efficiency of spiking neural networks (SNNs) by adopting SNN as the actor network and propose a low-carbon-oriented optimization algorithm, i.e., SNN-based deep reinforcement learning (SDRL). Comprehensive simulations demonstrate that SDRL algorithm significantly reduces overall carbon footprint, achieving an 18.77% reduction compared to the benchmark soft actor-critic, highlighting its potential for enabling more sustainable LLM inference services.
△ Less
Submitted 6 March, 2025;
originally announced March 2025.
-
Measurement of the Branching Fraction of $Λ_c^+ \to p K_S^0 π^0$ at Belle
Authors:
The Belle,
Belle II Collaborations,
:,
I. Adachi,
L. Aggarwal,
H. Ahmed,
J. K. Ahn,
H. Aihara,
N. Akopov,
M. Alhakami,
A. Aloisio,
N. Althubiti,
M. Angelsmark,
N. Anh Ky,
D. M. Asner,
H. Atmacan,
T. Aushev,
V. Aushev,
M. Aversano,
R. Ayad,
V. Babu,
H. Bae,
N. K. Baghel,
S. Bahinipati,
P. Bambade
, et al. (404 additional authors not shown)
Abstract:
We report a precise measurement of the ratio of branching fractions $\mathcal{B}(Λ_c^+\to p K_S^0 π^0)/\mathcal{B}(Λ_c^+\to p K^- π^+)$ using 980 fb$^{-1}$ of $e^+e^-$ data from the Belle experiment. We obtain a value of $\mathcal{B}(Λ_c^+\to p K_S^0 π^0)/\mathcal{B}(Λ_c^+\to p K^- π^+)=0.339\pm 0.002\pm 0.009$, where the first and second uncertainties are statistical and systematic, respectively.…
▽ More
We report a precise measurement of the ratio of branching fractions $\mathcal{B}(Λ_c^+\to p K_S^0 π^0)/\mathcal{B}(Λ_c^+\to p K^- π^+)$ using 980 fb$^{-1}$ of $e^+e^-$ data from the Belle experiment. We obtain a value of $\mathcal{B}(Λ_c^+\to p K_S^0 π^0)/\mathcal{B}(Λ_c^+\to p K^- π^+)=0.339\pm 0.002\pm 0.009$, where the first and second uncertainties are statistical and systematic, respectively. This Belle result is consistent with the previous measurement from the CLEO experiment but has a fivefold improvement in precision. By combining our result with the world average $\mathcal{B}(Λ_c^+\to p K^- π^+)$, we obtain the absolute branching fraction $\mathcal{B}(Λ_c^+\to p K_S^0 π^0)=(2.12\pm 0.01\pm 0.05 \pm 0.10)\%$, where the uncertainties are statistical, systematic, and the uncertainty in the absolute branching fraction scale $\mathcal{B}(Λ_c^+\to p K^- π^+)$, respectively. This measurement can shed light on hadronic decay mechanisms in charmed baryon decays.
△ Less
Submitted 18 March, 2025; v1 submitted 6 March, 2025;
originally announced March 2025.
-
Quantum interference and occupation control in high harmonic generation from monolayer $WS_2$
Authors:
Minjeong Kim,
Taeho Kim,
Anna Galler,
Dasol Kim,
Alexis Chacon,
Xiangxin Gong,
Yuhui Yang,
Rouli Fang,
Kenji Watanabe,
Takashi Taniguchi,
B. J. Kim,
Sang Hoon Chae,
Moon-Ho Jo,
Angel Rubio,
Ofer Neufeld,
Jonghwan Kim
Abstract:
Two-dimensional hexagonal materials such as transition metal dichalcogenides exhibit valley degrees of freedom, offering fascinating potential for valley-based quantum computing and optoelectronics. In nonlinear optics, the K and K' valleys provide excitation resonances that can be used for ultrafast control of excitons, Bloch oscillations, and Floquet physics. Under intense laser fields, however,…
▽ More
Two-dimensional hexagonal materials such as transition metal dichalcogenides exhibit valley degrees of freedom, offering fascinating potential for valley-based quantum computing and optoelectronics. In nonlinear optics, the K and K' valleys provide excitation resonances that can be used for ultrafast control of excitons, Bloch oscillations, and Floquet physics. Under intense laser fields, however, the role of coherent carrier dynamics away from the K/K' valleys is largely unexplored. In this study, we observe quantum interferences in high harmonic generation from monolayer $WS_2$ as laser fields drive electrons from the valleys across the full Brillouin zone. In the perturbative regime, interband resonances at the valleys enhance high harmonic generation through multi-photon excitations. In the strong-field regime, the high harmonic spectrum is sensitively controlled by light-driven quantum interferences between the interband valley resonances and intraband currents originating from electrons occupying various points in the Brillouin zone, also away from K/K' valleys such as $Γ$ and M. Our experimental observations are in strong agreement with quantum simulations, validating their interpretation. This work proposes new routes for harnessing laser-driven quantum interference in two-dimensional hexagonal systems and all-optical techniques to occupy and read-out electronic structures in the full Brillouin zone via strong-field nonlinear optics, advancing quantum technologies.
△ Less
Submitted 9 March, 2025; v1 submitted 6 March, 2025;
originally announced March 2025.
-
Recurrence relations for degenerate Bell and Dowling polynomials via Boson operators
Authors:
Taekyun Kim,
Dae San Kim
Abstract:
Spivey found a recurrence relation for the Bell numbers by using combinatorial method. The aim of this paper is to derive Spivey's type recurrence relations for the degenerate Bell polynomials and the degenerate Dowling polynomials by using the boson annihilation and creation operators satisfying the commutation relation aa+-a+a=1.
In addition, we derive a Spivey's type recurrence relation for t…
▽ More
Spivey found a recurrence relation for the Bell numbers by using combinatorial method. The aim of this paper is to derive Spivey's type recurrence relations for the degenerate Bell polynomials and the degenerate Dowling polynomials by using the boson annihilation and creation operators satisfying the commutation relation aa+-a+a=1.
In addition, we derive a Spivey's type recurrence relation for the r-Dowling polynomials.
△ Less
Submitted 4 March, 2025;
originally announced March 2025.
-
NASA Innovative Advanced Concepts Phase I Final Report -- A Lunar Long-Baseline UV/Optical Imaging Interferometer: Artemis-enabled Stellar Imager (AeSI)
Authors:
Kenneth G. Carpenter,
Tabetha Boyajian,
Derek Buzasi,
Jim Clark,
Michelle Creech-Eakman,
Bruce Dean,
Ashley Elliott,
Julianne Foster,
Qian Gong,
Margarita Karovska,
David Kim,
Jon Hulberg,
David Leisawitz,
Mike Maher,
Jon Morse,
Dave Mozurkewich,
Sarah Peacock,
Noah Petro,
Gioia Rau,
Paul Scowen,
Len Seals,
Walter Smith,
Max Smuda,
Breann Sitarski,
Buddy Taylor
, et al. (2 additional authors not shown)
Abstract:
This report presents the findings of a NIAC Phase I feasibility study for the Artemis-enabled Stellar Imager (AeSI), a proposed high-resolution, UV/Optical interferometer designed for deployment on the lunar surface. Its primary science goal is to image the surfaces and interiors of stars with unprecedented detail, revealing new details about their magnetic processes and dynamic evolution and enab…
▽ More
This report presents the findings of a NIAC Phase I feasibility study for the Artemis-enabled Stellar Imager (AeSI), a proposed high-resolution, UV/Optical interferometer designed for deployment on the lunar surface. Its primary science goal is to image the surfaces and interiors of stars with unprecedented detail, revealing new details about their magnetic processes and dynamic evolution and enabling the creation of a truly predictive solar/stellar dynamo model. This capability will transform our understanding of stellar physics and has broad applicability across astrophysics, from resolving the cores of Active Galactic Nuclei (AGN) to studying supernovae, planetary nebulae, and the late stages of stellar evolution. By leveraging the stable vacuum environment of the Moon and the infrastructure being established for the Artemis Program, AeSI presents a compelling case for a lunar-based interferometer. In this study, the AeSI Team, working with the NASA Goddard Space Flight Center's Integrated Design Center (IDC), has firmly established the feasibility of building and operating a reconfigurable, dispersed aperture telescope (i.e., an interferometer) on the lunar surface. The collaboration produced a credible Baseline design featuring 15 primary mirrors arranged in an elliptical array with a 1 km major axis, with the potential to expand to 30 mirrors and larger array sizes through staged deployments. Additionally, this study identified numerous opportunities for optimization and the necessary trade studies to refine the design further. These will be pursued in follow-up investigations, such as a NIAC Phase II study, to advance the concept toward implementation.
△ Less
Submitted 3 March, 2025;
originally announced March 2025.
-
Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs
Authors:
Microsoft,
:,
Abdelrahman Abouelenin,
Atabak Ashfaq,
Adam Atkinson,
Hany Awadalla,
Nguyen Bach,
Jianmin Bao,
Alon Benhaim,
Martin Cai,
Vishrav Chaudhary,
Congcong Chen,
Dong Chen,
Dongdong Chen,
Junkun Chen,
Weizhu Chen,
Yen-Chun Chen,
Yi-ling Chen,
Qi Dai,
Xiyang Dai,
Ruchao Fan,
Mei Gao,
Min Gao,
Amit Garg,
Abhishek Goswami
, et al. (51 additional authors not shown)
Abstract:
We introduce Phi-4-Mini and Phi-4-Multimodal, compact yet highly capable language and multimodal models. Phi-4-Mini is a 3.8-billion-parameter language model trained on high-quality web and synthetic data, significantly outperforming recent open-source models of similar size and matching the performance of models twice its size on math and coding tasks requiring complex reasoning. This achievement…
▽ More
We introduce Phi-4-Mini and Phi-4-Multimodal, compact yet highly capable language and multimodal models. Phi-4-Mini is a 3.8-billion-parameter language model trained on high-quality web and synthetic data, significantly outperforming recent open-source models of similar size and matching the performance of models twice its size on math and coding tasks requiring complex reasoning. This achievement is driven by a carefully curated synthetic data recipe emphasizing high-quality math and coding datasets. Compared to its predecessor, Phi-3.5-Mini, Phi-4-Mini features an expanded vocabulary size of 200K tokens to better support multilingual applications, as well as group query attention for more efficient long-sequence generation. Phi-4-Multimodal is a multimodal model that integrates text, vision, and speech/audio input modalities into a single model. Its novel modality extension approach leverages LoRA adapters and modality-specific routers to allow multiple inference modes combining various modalities without interference. For example, it now ranks first in the OpenASR leaderboard to date, although the LoRA component of the speech/audio modality has just 460 million parameters. Phi-4-Multimodal supports scenarios involving (vision + language), (vision + speech), and (speech/audio) inputs, outperforming larger vision-language and speech-language models on a wide range of tasks. Additionally, we experiment to further train Phi-4-Mini to enhance its reasoning capabilities. Despite its compact 3.8-billion-parameter size, this experimental version achieves reasoning performance on par with or surpassing significantly larger models, including DeepSeek-R1-Distill-Qwen-7B and DeepSeek-R1-Distill-Llama-8B.
△ Less
Submitted 7 March, 2025; v1 submitted 3 March, 2025;
originally announced March 2025.
-
CoPL: Collaborative Preference Learning for Personalizing LLMs
Authors:
Youngbin Choi,
Seunghyuk Cho,
Minjong Lee,
MoonJeong Park,
Yesong Ko,
Jungseul Ok,
Dongwoo Kim
Abstract:
Personalizing large language models (LLMs) is important for aligning outputs with diverse user preferences, yet existing methods struggle with flexibility and generalization. We propose CoPL (Collaborative Preference Learning), a graph-based collaborative filtering framework that models user-response relationships to enhance preference estimation, particularly in sparse annotation settings. By int…
▽ More
Personalizing large language models (LLMs) is important for aligning outputs with diverse user preferences, yet existing methods struggle with flexibility and generalization. We propose CoPL (Collaborative Preference Learning), a graph-based collaborative filtering framework that models user-response relationships to enhance preference estimation, particularly in sparse annotation settings. By integrating a mixture of LoRA experts, CoPL efficiently fine-tunes LLMs while dynamically balancing shared and user-specific preferences. Additionally, an optimization-free adaptation strategy enables generalization to unseen users without fine-tuning. Experiments on UltraFeedback-P demonstrate that CoPL outperforms existing personalized reward models, effectively capturing both common and controversial preferences, making it a scalable solution for personalized LLM alignment.
△ Less
Submitted 3 March, 2025;
originally announced March 2025.
-
First X-ray polarimetric view of a Low-Luminosity Active Galactic Nucleus: the case of NGC 2110
Authors:
Sudip Chakraborty,
Ajay Ratheesh,
Daniele Tagliacozzo,
Philip Kaaret,
Jakub Podgorný,
Frédéric Marin,
Francesco Tombesi,
Steven R. Ehlert,
Chien-Ting J. Chen,
Dawoon E. Kim,
Ioannis Liodakis,
Francesco Ursini,
Riccardo Middei,
Alessandro Di Marco,
Fabio La Monaca,
Srimanta Banerjee,
Keigo Fukumura,
W. Peter Maksym,
Romana Mikušincová,
Rodrigo Nemmen,
Pierre-Olivier Petrucci,
Paolo Soffitta,
Jiří Svoboda
Abstract:
Low-Luminosity Active Galactic Nuclei (LLAGN) provides a unique view of Comptonization and non-thermal emission from accreting black holes in the low-accretion rate regime. However, to decipher the exact nature of the Comptonizing corona in LLAGN, its geometry and emission mechanism must be understood beyond the limits of spectro-timing techniques. Spectro-polarimetry offers the potential to break…
▽ More
Low-Luminosity Active Galactic Nuclei (LLAGN) provides a unique view of Comptonization and non-thermal emission from accreting black holes in the low-accretion rate regime. However, to decipher the exact nature of the Comptonizing corona in LLAGN, its geometry and emission mechanism must be understood beyond the limits of spectro-timing techniques. Spectro-polarimetry offers the potential to break the degeneracies between different coronal emission models. Compton-thin LLAGN provide an opportunity for such spectro-polarimetric exploration in the 2-8 keV energy range using IXPE. In this work, we carry out a spectro-polarimetric analysis of the first IXPE observation, in synergy with a contemporaneous NuSTAR observation, of an LLAGN: NGC 2110. Using 554.4 ks of IXPE data from October 2024, we constrain the 99% upper limit on the Polarization Degree (PD) to be less than 8.3% assuming the corresponding Polarization Angle (PA) to be aligned with the radio jet, and less than 3.6% if in the perpendicular direction. In the absence of a significant PD detection, the PA remains formally unconstrained, yet the polarization significance contours appear to be aligned with the radio jet, tentatively supporting models in which the corona is radially extended in the plane of the disk. We also carry out detailed Monte Carlo simulations using MONK and STOKES codes to test different coronal models against our results and compare the polarization properties between NGC 2110 and brighter Seyferts.
△ Less
Submitted 2 March, 2025;
originally announced March 2025.
-
VTuber's Atelier: The Design Space, Challenges, and Opportunities for VTubing
Authors:
Daye Kim,
Sebin Lee,
Yoonseo Jun,
Yujin Shin,
Jungjin Lee
Abstract:
VTubing, the practice of live streaming using virtual avatars, has gained worldwide popularity among streamers seeking to maintain anonymity. While previous research has primarily focused on the social and cultural aspects of VTubing, there is a noticeable lack of studies examining the practical challenges VTubers face in creating and operating their avatars. To address this gap, we surveyed VTube…
▽ More
VTubing, the practice of live streaming using virtual avatars, has gained worldwide popularity among streamers seeking to maintain anonymity. While previous research has primarily focused on the social and cultural aspects of VTubing, there is a noticeable lack of studies examining the practical challenges VTubers face in creating and operating their avatars. To address this gap, we surveyed VTubers' equipment and expanded the live-streaming design space by introducing six new dimensions related to avatar creation and control. Additionally, we conducted interviews with 16 professional VTubers to comprehensively explore their practices, strategies, and challenges throughout the VTubing process. Our findings reveal that VTubers face significant burdens compared to real-person streamers due to fragmented tools and the multi-tasking nature of VTubing, leading to unique workarounds. Finally, we summarize these challenges and propose design opportunities to improve the effectiveness and efficiency of VTubing.
△ Less
Submitted 2 March, 2025;
originally announced March 2025.
-
Aerial Secure Collaborative Communications under Eavesdropper Collusion in Low-altitude Economy: A Generative Swarm Intelligent Approach
Authors:
Jiahui Li,
Geng Sun,
Qingqing Wu,
Shuang Liang,
Jiacheng Wang,
Dusit Niyato,
Dong In Kim
Abstract:
In this work, we aim to introduce distributed collaborative beamforming (DCB) into AAV swarms and handle the eavesdropper collusion by controlling the corresponding signal distributions. Specifically, we consider a two-way DCB-enabled aerial communication between two AAV swarms and construct these swarms as two AAV virtual antenna arrays. Then, we minimize the two-way known secrecy capacity and ma…
▽ More
In this work, we aim to introduce distributed collaborative beamforming (DCB) into AAV swarms and handle the eavesdropper collusion by controlling the corresponding signal distributions. Specifically, we consider a two-way DCB-enabled aerial communication between two AAV swarms and construct these swarms as two AAV virtual antenna arrays. Then, we minimize the two-way known secrecy capacity and maximum sidelobe level to avoid information leakage from the known and unknown eavesdroppers, respectively. Simultaneously, we also minimize the energy consumption of AAVs when constructing virtual antenna arrays. Due to the conflicting relationships between secure performance and energy efficiency, we consider these objectives by formulating a multi-objective optimization problem, which is NP-hard and with a large number of decision variables. Accordingly, we design a novel generative swarm intelligence (GenSI) framework to solve the problem with less overhead, which contains a conditional variational autoencoder (CVAE)-based generative method and a proposed powerful swarm intelligence algorithm. In this framework, CVAE can collect expert solutions obtained by the swarm intelligence algorithm in other environment states to explore characteristics and patterns, thereby directly generating high-quality initial solutions in new environment factors for the swarm intelligence algorithm to search solution space efficiently. Simulation results show that the proposed swarm intelligence algorithm outperforms other state-of-the-art baseline algorithms, and the GenSI can achieve similar optimization results by using far fewer iterations than the ordinary swarm intelligence algorithm. Experimental tests demonstrate that introducing the CVAE mechanism achieves a 58.7% reduction in execution time, which enables the deployment of GenSI even on AAV platforms with limited computing power.
△ Less
Submitted 1 March, 2025;
originally announced March 2025.
-
KatFishNet: Detecting LLM-Generated Korean Text through Linguistic Feature Analysis
Authors:
Shinwoo Park,
Shubin Kim,
Do-Kyung Kim,
Yo-Sub Han
Abstract:
The rapid advancement of large language models (LLMs) increases the difficulty of distinguishing between human-written and LLM-generated text. Detecting LLM-generated text is crucial for upholding academic integrity, preventing plagiarism, protecting copyrights, and ensuring ethical research practices. Most prior studies on detecting LLM-generated text focus primarily on English text. However, lan…
▽ More
The rapid advancement of large language models (LLMs) increases the difficulty of distinguishing between human-written and LLM-generated text. Detecting LLM-generated text is crucial for upholding academic integrity, preventing plagiarism, protecting copyrights, and ensuring ethical research practices. Most prior studies on detecting LLM-generated text focus primarily on English text. However, languages with distinct morphological and syntactic characteristics require specialized detection approaches. Their unique structures and usage patterns can hinder the direct application of methods primarily designed for English. Among such languages, we focus on Korean, which has relatively flexible spacing rules, a rich morphological system, and less frequent comma usage compared to English. We introduce KatFish, the first benchmark dataset for detecting LLM-generated Korean text. The dataset consists of text written by humans and generated by four LLMs across three genres.
By examining spacing patterns, part-of-speech diversity, and comma usage, we illuminate the linguistic differences between human-written and LLM-generated Korean text. Building on these observations, we propose KatFishNet, a detection method specifically designed for the Korean language. KatFishNet achieves an average of 19.78% higher AUROC compared to the best-performing existing detection method. Our code and data are available at https://github.com/Shinwoo-Park/detecting_llm_generated_korean_text_through_linguistic_analysis.
△ Less
Submitted 1 July, 2025; v1 submitted 24 February, 2025;
originally announced March 2025.
-
Differentially private synthesis of Spatial Point Processes
Authors:
Dangchan Kim,
Chae Young Lim
Abstract:
This paper proposes a method to generate synthetic data for spatial point patterns within the differential privacy (DP) framework. Specifically, we define a differentially private Poisson point synthesizer (PPS) and Cox point synthesizer (CPS) to generate synthetic point patterns with the concept of the $α$-neighborhood that relaxes the original definition of DP. We present three example models to…
▽ More
This paper proposes a method to generate synthetic data for spatial point patterns within the differential privacy (DP) framework. Specifically, we define a differentially private Poisson point synthesizer (PPS) and Cox point synthesizer (CPS) to generate synthetic point patterns with the concept of the $α$-neighborhood that relaxes the original definition of DP. We present three example models to construct a differentially private PPS and CPS, providing sufficient conditions on their parameters to ensure the DP given a specified privacy budget. In addition, we demonstrate that the synthesizers can be applied to point patterns on the linear network. Simulation experiments demonstrate that the proposed approaches effectively maintain the privacy and utility of synthetic data.
△ Less
Submitted 25 February, 2025;
originally announced February 2025.