Skip to main content

Showing 1–50 of 4,668 results for author: Park, J

.
  1. arXiv:2507.07533  [pdf

    cond-mat.supr-con cond-mat.mes-hall cond-mat.str-el

    Dark states of electrons in a quantum system with two pairs of sublattices

    Authors: Yoonah Chung, Minsu Kim, Yeryn Kim, Seyeong Cha, Joon Woo Park, Jeehong Park, Yeonjin Yi, Dongjoon Song, Jung Hyun Ryu, Kimoon Lee, Timur K. Kim, Cephise Cacho, Jonathan Denlinger, Chris Jozwiak, Eli Rotenberg, Aaron Bostwick, Keun Su Kim

    Abstract: A quantum state of matter that is forbidden to interact with photons and is therefore undetectable by spectroscopic means is called a dark state. This basic concept can be applied to condensed matter where it suggests that a whole band of quantum states could be undetectable across a full Brillouin zone. Here we report the discovery of such condensed matter dark states in palladium diselenide as a… ▽ More

    Submitted 10 July, 2025; originally announced July 2025.

    Journal ref: Nature Physics 20, 1582-1588 (2024)

  2. arXiv:2507.07476  [pdf, ps, other

    hep-ph hep-ex

    A comparative study of physics capabilities of a liquid argon and a water based liquid scintillator at DUNE

    Authors: Nishat Fiza, Suhyeon Kim, Emar Masaku, Mehedi Masud, Hokyeong Nam, Juseong Park, Yujin Park, Kim Siyeon

    Abstract: We present a comprehensive comparison of the physics sensitivities of a Liquid Argon Time Projection Chamber (LArTPC) and a Water-based Liquid Scintillator (WbLS) detector, considering their potential deployment as the fourth far detector module in the DUNE facility. Using GLoBES-based simulations, we evaluate their performance in measuring standard neutrino oscillation parameters (… ▽ More

    Submitted 10 July, 2025; originally announced July 2025.

    Comments: 25 pages, 9 figures, 2 tables

  3. arXiv:2507.07147  [pdf, ps, other

    cs.LG cs.AI cs.CL cs.CV

    Weighted Multi-Prompt Learning with Description-free Large Language Model Distillation

    Authors: Sua Lee, Kyubum Shin, Jung Ho Park

    Abstract: Recent advances in pre-trained Vision Language Models (VLM) have shown promising potential for effectively adapting to downstream tasks through prompt learning, without the need for additional annotated paired datasets. To supplement the text information in VLM trained on correlations with vision data, new approaches leveraging Large Language Models (LLM) in prompts have been proposed, enhancing r… ▽ More

    Submitted 9 July, 2025; originally announced July 2025.

    Comments: Published as a conference paper at ICLR 2025

  4. arXiv:2507.06785  [pdf, ps, other

    stat.ME stat.AP

    Bayesian Bootstrap-based Gaussian Copula Model for Mixed Data with High Missing Rates

    Authors: Seongmin Kim, Jeunghun Oh, Hungkuk Ko, Jeongmin Park, Jaeyong Lee

    Abstract: Missing data is a common issue in various fields such as medicine, social sciences, and natural sciences, and it poses significant challenges for accurate statistical analysis. Although numerous imputation methods have been proposed to address this issue, many of them fail to adequately capture the complex dependency structure among variables. To overcome this limitation, models based on the Gauss… ▽ More

    Submitted 9 July, 2025; originally announced July 2025.

    Comments: 29 pages, 1 figure, 4 tables

  5. arXiv:2507.06782  [pdf, ps, other

    cs.IR cs.AI cs.LG

    Temporal Information Retrieval via Time-Specifier Model Merging

    Authors: SeungYoon Han, Taeho Hwang, Sukmin Cho, Soyeong Jeong, Hoyun Song, Huije Lee, Jong C. Park

    Abstract: The rapid expansion of digital information and knowledge across structured and unstructured sources has heightened the importance of Information Retrieval (IR). While dense retrieval methods have substantially improved semantic matching for general queries, they consistently underperform on queries with explicit temporal constraints--often those containing numerical expressions and time specifiers… ▽ More

    Submitted 9 July, 2025; originally announced July 2025.

  6. arXiv:2507.06754  [pdf, ps, other

    math.NT math.AG math.KT

    Counting isomorphism classes of elliptic curves over $\mathbb{F}_q(t)$

    Authors: Jun-Yong Park

    Abstract: We determine the precise number of isomorphism classes of elliptic curves over $\mathbb{F}_q(t)$ with $\text{char}(\mathbb{F}_q) = 3,2$. The key idea is to obtain the exact unweighted number of rational points on the classifying stacks $\mathcal{B} Q_{12}$, $\mathcal{B} Q_{24}$ and $\mathcal{B} Z$, where $Q_{12}$ and $Q_{24}$ denote the dicyclic groups of orders 12 and 24, respectively, and $Z$ de… ▽ More

    Submitted 9 July, 2025; originally announced July 2025.

    Comments: 13 pages; Comments very welcome!

  7. arXiv:2507.06543  [pdf, ps, other

    cs.CV

    Token Bottleneck: One Token to Remember Dynamics

    Authors: Taekyung Kim, Dongyoon Han, Byeongho Heo, Jeongeun Park, Sangdoo Yun

    Abstract: Deriving compact and temporally aware visual representations from dynamic scenes is essential for successful execution of sequential scene understanding tasks such as visual tracking and robotic manipulation. In this paper, we introduce Token Bottleneck (ToBo), a simple yet intuitive self-supervised learning pipeline that squeezes a scene into a bottleneck token and predicts the subsequent scene u… ▽ More

    Submitted 9 July, 2025; originally announced July 2025.

    Comments: 17 pages, 9 figures, 8 tables, project page: https://token-bottleneck.github.io, code: https://github.com/naver-ai/tobo

  8. arXiv:2507.06371  [pdf, ps, other

    cond-mat.mtrl-sci cond-mat.str-el physics.optics

    Terahertz field-induced metastable magnetization near criticality in FePS3

    Authors: Batyr Ilyas, Tianchuang Luo, Alexander von Hoegen, Emil Viñas Boström, Zhuquan Zhang, Jaena Park, Junghyun Kim, Je-Geun Park, Keith A. Nelson, Angel Rubio, Nuh Gedik

    Abstract: Controlling the functional properties of quantum materials with light has emerged as a frontier of condensed-matter physics, leading to the discovery of various light-induced phases of matter, such as superconductivity, ferroelectricity, magnetism and charge density waves. However, in most cases, the photoinduced phases return to equilibrium on ultrafast timescales after the light is turned off, l… ▽ More

    Submitted 8 July, 2025; originally announced July 2025.

    Comments: 33 pages, 4 figures

    Journal ref: Nature 636 (2024) 609-614

  9. arXiv:2507.06261  [pdf, ps, other

    cs.CL cs.AI

    Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

    Authors: Gheorghe Comanici, Eric Bieber, Mike Schaekermann, Ice Pasupat, Noveen Sachdeva, Inderjit Dhillon, Marcel Blistein, Ori Ram, Dan Zhang, Evan Rosen, Luke Marris, Sam Petulla, Colin Gaffney, Asaf Aharoni, Nathan Lintz, Tiago Cardal Pais, Henrik Jacobsson, Idan Szpektor, Nan-Jiang Jiang, Krishna Haridasan, Ahmed Omran, Nikunj Saunshi, Dara Bahri, Gaurav Mishra, Eric Chu , et al. (3278 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 2.X model family: Gemini 2.5 Pro and Gemini 2.5 Flash, as well as our earlier Gemini 2.0 Flash and Flash-Lite models. Gemini 2.5 Pro is our most capable model yet, achieving SoTA performance on frontier coding and reasoning benchmarks. In addition to its incredible coding and reasoning skills, Gemini 2.5 Pro is a thinking model that excels at multimodal unde… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

    Comments: 72 pages, 17 figures

  10. arXiv:2507.06233  [pdf, ps, other

    cs.CV

    Learning to Track Any Points from Human Motion

    Authors: Inès Hyeonsu Kim, Seokju Cho, Jahyeok Koo, Junghyun Park, Jiahui Huang, Joon-Young Lee, Seungryong Kim

    Abstract: Human motion, with its inherent complexities, such as non-rigid deformations, articulated movements, clothing distortions, and frequent occlusions caused by limbs or other individuals, provides a rich and challenging source of supervision that is crucial for training robust and generalizable point trackers. Despite the suitability of human motion, acquiring extensive training data for point tracki… ▽ More

    Submitted 8 July, 2025; originally announced July 2025.

    Comments: Project Page: https://cvlab-kaist.github.io/AnthroTAP/

  11. arXiv:2507.06133  [pdf, ps, other

    cs.CE

    Bridging Sequential Deep Operator Network and Video Diffusion: Residual Refinement of Spatio-Temporal PDE Solutions

    Authors: Jaewan Park, Farid Ahmed, Kazuma Kobayashi, Seid Koric, Syed Bahauddin Alam, Iwona Jasiuk, Diab Abueidda

    Abstract: Video-diffusion models have recently set the standard in video generation, inpainting, and domain translation thanks to their training stability and high perceptual fidelity. Building on these strengths, we repurpose conditional video diffusion as a physics surrogate for spatio-temporal fields governed by partial differential equations (PDEs). Our two-stage surrogate first applies a Sequential Dee… ▽ More

    Submitted 8 July, 2025; originally announced July 2025.

  12. arXiv:2507.06101  [pdf

    cond-mat.mtrl-sci physics.comp-ph

    Reference compositions for bismuth telluride thermoelectric materials for low-temperature power generation

    Authors: Nirma Kumari, Jaywan Chung, Seunghyun Oh, Jeongin Jang, Jongho Park, Ji Hui Son, SuDong Park, Byungki Ryu

    Abstract: Thermoelectric (TE) technology enables direct heat-to-electricity conversion and is gaining attention as a clean, fuel-saving, and carbon-neutral solution for industrial, automotive, and marine applications. Despite nearly a century of research, apart from successes in deep-space power sources and solid-state cooling modules, the industrialization and commercialization of TE power generation remai… ▽ More

    Submitted 9 July, 2025; v1 submitted 8 July, 2025; originally announced July 2025.

    Comments: 45 pages, 4 tables, 14 figures (DOI info added for future activation upon publication. Error updated for k_ph)

  13. arXiv:2507.06049  [pdf, ps, other

    stat.ME stat.AP

    FDR controlling procedures with dimension reduction and their application to GWAS with linkage disequilibrium score

    Authors: Dayeon Jung, Yewon Kim, Junyong Park

    Abstract: Genome-wide association studies (GWAS) have led to the discovery of numerous single nucleotide polymorphisms (SNPs) associated with various phenotypes and complex diseases. However, the identified genetic variants do not fully explain the heritability of complex traits, known as the missing heritability problem. To address this challenge and accurately control false positives while maximizing true… ▽ More

    Submitted 8 July, 2025; originally announced July 2025.

  14. arXiv:2507.05822  [pdf, ps, other

    cs.CV

    Video Event Reasoning and Prediction by Fusing World Knowledge from LLMs with Vision Foundation Models

    Authors: L'ea Dubois, Klaus Schmidt, Chengyu Wang, Ji-Hoon Park, Lin Wang, Santiago Munoz

    Abstract: Current video understanding models excel at recognizing "what" is happening but fall short in high-level cognitive tasks like causal reasoning and future prediction, a limitation rooted in their lack of commonsense world knowledge. To bridge this cognitive gap, we propose a novel framework that synergistically fuses a powerful Vision Foundation Model (VFM) for deep visual perception with a Large L… ▽ More

    Submitted 8 July, 2025; originally announced July 2025.

    Comments: 22 pages, 4 figures

    MSC Class: CS ACM Class: I.2.10

  15. arXiv:2507.05673  [pdf, ps, other

    cs.CV

    R-VLM: Region-Aware Vision Language Model for Precise GUI Grounding

    Authors: Joonhyung Park, Peng Tang, Sagnik Das, Srikar Appalaraju, Kunwar Yashraj Singh, R. Manmatha, Shabnam Ghadar

    Abstract: Visual agent models for automating human activities on Graphical User Interfaces (GUIs) have emerged as a promising research direction, driven by advances in large Vision Language Models (VLMs). A critical challenge in GUI automation is the precise grounding of interface elements across diverse platforms. Existing vision-only GUI agents directly ground elements from large and cluttered screenshots… ▽ More

    Submitted 8 July, 2025; originally announced July 2025.

    Comments: ACL 2025; 17 pages

  16. arXiv:2507.05585  [pdf, ps, other

    math.PR

    Capacity of the range of random walk: Moderate deviations in dimensions 4 and 5

    Authors: Arka Adhikari, Jiyun Park

    Abstract: We prove a moderate deviation principle for the capacity of the range of random walk in $\mathbb{Z}^5$. Depending on the scale of deviation, we get two different regimes. We observe Gaussian tails when the deviation scale is smaller than $n^{1/2} (\log n)^{3/4}$. Otherwise, we get non-Gaussian tails with a constant arising from a generalized Gagliardo-Nirenberg inequality. This is analogous to the… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

    Comments: 33 pages

    MSC Class: 60F10; 60G50

  17. arXiv:2507.05094  [pdf, ps, other

    hep-ex

    Observation of the decays $B^{+} \to Σ_{c}(2455)^{++} \overlineΞ_{c}^{-}$ and $B^{0} \to Σ_{c}(2455)^{0} \overlineΞ_{c}^{0}$

    Authors: Belle, Belle II Collaborations, :, M. Abumusabh, I. Adachi, L. Aggarwal, H. Ahmed, Y. Ahn, H. Aihara, N. Akopov, S. Alghamdi, M. Alhakami, A. Aloisio, N. Althubiti, K. Amos, N. Anh Ky, D. M. Asner, H. Atmacan, T. Aushev, V. Aushev, R. Ayad, V. Babu, H. Bae, N. K. Baghel, S. Bahinipati , et al. (364 additional authors not shown)

    Abstract: We report the first observation of the two-body baryonic decays $B^{+} \to Σ_{c}(2455)^{++} \overlineΞ_{c}^{-}$ and $B^{0} \to Σ_{c}(2455)^{0} \overlineΞ_{c}^{0}$ with significances of $7.3\,σ$ and $6.2\,σ$, respectively, including statistical and systematic uncertainties. The branching fractions are measured to be… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

    Report number: Belle II Preprint 2025-019, KEK Preprint 2025-18

  18. arXiv:2507.05050  [pdf, ps, other

    hep-ex

    Measurement of the $ D^{0}\rightarrow K^{-}π^{+}e^{+}e^{-} $ branching fraction and search for $ D^{0}\rightarrow π^{+}π^{-}e^{+}e^{-} $ and $D^{0}\rightarrow K^{+}K^{-}e^{+}e^{-} $ decays at Belle

    Authors: Belle, Belle II Collaborations, :, I. Adachi, L. Aggarwal, H. Ahmed, Y. Ahn, H. Aihara, N. Akopov, S. Alghamdi, M. Alhakami, A. Aloisio, N. Althubiti, K. Amos, M. Angelsmark, N. Anh Ky, C. Antonioli, D. M. Asner, H. Atmacan, T. Aushev, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae , et al. (458 additional authors not shown)

    Abstract: We present a study of the rare charm meson decays $ D^{0}\rightarrow K^{+}K^{-}e^{+}e^{-} $, $ π^{+}π^{-}e^{+}e^{-} $, and $ K^{-}π^{+}e^{+}e^{-} $ using a 942 fb$^{-1}$ data set collected by the Belle detector at the KEKB asymmetric-energy $ e^{+}e^{-} $ collider. We use $ D^{0} $ candidates identified by the charge of the pion in $ D^{*} \rightarrow D^{0} π$ decays and normalize the branching fr… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

    Report number: Belle II Preprint 2025-020; KEK Preprint 2025-19

  19. arXiv:2507.04896  [pdf, ps, other

    hep-ex

    Cross sections of $η$ mesons in $p$$+$$p$ collisions at forward rapidity at $\sqrt{s}=500$ GeV and central rapidity at $\sqrt{s}=510$ GeV

    Authors: PHENIX Collaboration, N. J. Abdulameer, U. Acharya, A. Adare, C. Aidala, N. N. Ajitanand, Y. Akiba, R. Akimoto, H. Al-Ta'ani, J. Alexander, M. Alfred, D. Anderson, K. R. Andrews, A. Angerami, S. Antsupov, K. Aoki, N. Apadula, E. Appelt, Y. Aramaki, R. Armendariz, H. Asano, E. C. Aschenauer, E. T. Atomssa, T. C. Awes, B. Azmoun , et al. (476 additional authors not shown)

    Abstract: We present the first measurements of the forward and midrapidity $η$-meson cross sections from $p$$+$$p$ collisions at $\sqrt{s}=500$ and $510$~GeV, respectively. We also report the midrapidity $η/π^0$ ratio at 510 GeV. The forward cross section is measured differentially in $η$-meson transverse momentum ($p_T$) from 1.0 to 6.5~GeV/$c$ for pseudorapidity $3.0<|η|<3.8$. The midrapidity cross sectio… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

    Comments: 500 authors from 81 institutions, 14 pages, 7 figures, 3 tables. v1 is version submitted to Physical Review D. HEPdata tables for the points plotted in figures for this and previous PHENIX publications are (or will be) publicly available at http://www.phenix.bnl.gov/papers.html

  20. arXiv:2507.04482  [pdf, ps, other

    cs.CV

    A Training-Free Style-Personalization via Scale-wise Autoregressive Model

    Authors: Kyoungmin Lee, Jihun Park, Jongmin Gim, Wonhyeok Choi, Kyumin Hwang, Jaeyeul Kim, Sunghoon Im

    Abstract: We present a training-free framework for style-personalized image generation that controls content and style information during inference using a scale-wise autoregressive model. Our method employs a three-path design--content, style, and generation--each guided by a corresponding text prompt, enabling flexible and efficient control over image semantics without any additional training. A central c… ▽ More

    Submitted 6 July, 2025; originally announced July 2025.

    Comments: 13 pages, 10 figures

  21. arXiv:2507.04463  [pdf, ps, other

    nucl-ex

    Low-mass vector-meson production at forward rapidity in $p$$+$$p$ and Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$~GeV

    Authors: PHENIX Collaboration, N. J. Abdulameer, U. Acharya, A. Adare, C. Aidala, N. N. Ajitanand, Y. Akiba, M. Alfred, D. Anderson, V. Andrieux, S. Antsupov, N. Apadula, H. Asano, B. Azmoun, V. Babintsev, M. Bai, N. S. Bandara, B. Bannier, E. Bannikov, K. N. Barish, S. Bathe, A. Bazilevsky, M. Beaumier, S. Beckman, R. Belmont , et al. (331 additional authors not shown)

    Abstract: The PHENIX experiment at the Relativistic Heavy Ion Collider has measured low-mass vector-meson ($ω+ρ$ and $φ$) production through the dimuon decay channel at forward rapidity $(1.2<|\mbox{y}|<2.2)$ in $p$$+$$p$ and Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$~GeV. The low-mass vector-meson yield and nuclear-modification factor were measured as a function of the average number of participating nuc… ▽ More

    Submitted 6 July, 2025; originally announced July 2025.

    Comments: 356 authors from 71 institutions, 14 pages, 14 figures, 1 table. v1 is version submitted to Physical Review C. HEPdata tables for the points plotted in figures for this and previous PHENIX publications are (or will be) publicly available at http://www.phenix.bnl.gov/papers.html

  22. arXiv:2507.04157  [pdf, ps, other

    physics.optics

    Hyperspectral Dual-Comb Compressive Imaging for Minimally-Invasive Video-Rate Endomicroscopy

    Authors: Myoung-Gyun Suh, David Dang, Maodong Gao, Yucheng Jin, Byoung Jun Park, Beyonce Hu, Wilton J. M. Kort-Kamp, Ho Wai, Lee

    Abstract: Endoscopic imaging is essential for real-time visualization of internal organs, yet conventional systems remain bulky, complex, and expensive due to their reliance on large, multi-element optical components. This limits their accessibility to delicate or constrained anatomical regions. Achieving real-time, high-resolution endomicroscopy using compact, low-cost hardware at the hundred-micron scale… ▽ More

    Submitted 5 July, 2025; originally announced July 2025.

  23. Handling Korean Out-of-Vocabulary Words with Phoneme Representation Learning

    Authors: Nayeon Kim, Eojin Jeon, Jun-Hyung Park, SangKeun Lee

    Abstract: In this study, we introduce KOPL, a novel framework for handling Korean OOV words with Phoneme representation Learning. Our work is based on the linguistic property of Korean as a phonemic script, the high correlation between phonemes and letters. KOPL incorporates phoneme and word representations for Korean OOV words, facilitating Korean OOV word representations to capture both text and phoneme i… ▽ More

    Submitted 5 July, 2025; originally announced July 2025.

    Journal ref: Advances in Knowledge Discovery and Data Mining. PAKDD 2025

  24. arXiv:2507.03660  [pdf, ps, other

    cs.LG

    When Network Architecture Meets Physics: Deep Operator Learning for Coupled Multiphysics

    Authors: Kazuma Kobayashi, Jaewan Park, Qibang Liu, Seid Koric, Diab Abueidda, Syed Bahauddin Alam

    Abstract: Scientific applications increasingly demand real-time surrogate models that can capture the behavior of strongly coupled multiphysics systems driven by multiple input functions, such as in thermo-mechanical and electro-thermal processes. While neural operator frameworks, such as Deep Operator Networks (DeepONets), have shown considerable success in single-physics settings, their extension to multi… ▽ More

    Submitted 4 July, 2025; originally announced July 2025.

  25. arXiv:2507.03603  [pdf, ps, other

    nucl-ex

    Selection bias effects on high-$p_\mathrm{T}$ yield and correlation measurements in Oxygen+Oxygen collisions

    Authors: JaeBeom Park, J. L. Nagle, Dennis V. Perepelitsa, Sanghoon Lim, Constantin Loizides

    Abstract: Oxygen+Oxygen (O+O) collisions at RHIC and the LHC offer a unique experimental opportunity to observe the onset of jet quenching in intermediate relativistic collision systems. As with the smaller proton-nucleus or larger nucleus-nucleus systems, measurements of centrality-selected high-$p_\mathrm{T}$ processes in O+O collisions are expected to be sensitive to selection bias effects, which will be… ▽ More

    Submitted 4 July, 2025; originally announced July 2025.

    Comments: 9 pages, 12 figures, comments welcome before journal submission

  26. arXiv:2507.03192  [pdf, ps, other

    math.NA

    Parallel multilevel methods for solving the Darcy--Forchheimer model based on a nearly semicoercive formulation

    Authors: Jongho Park, S. Majid Hassanizadeh

    Abstract: High-velocity fluid flow through porous media is modeled by prescribing a nonlinear relationship between the flow rate and the pressure gradient, called Darcy--Forchheimer equation. This paper is concerned with the analysis of parallel multilevel methods for solving the Darcy--Forchheimer model. We begin by reformulating the Darcy--Forchheimer model as a nearly semicoercive convex optimization pro… ▽ More

    Submitted 3 July, 2025; originally announced July 2025.

    Comments: 21 pages, 3 figures

    MSC Class: 65N55; 65N20; 76S05; 90C25

  27. arXiv:2507.03114  [pdf, ps, other

    cs.DC

    Characterizing Compute-Communication Overlap in GPU-Accelerated Distributed Deep Learning: Performance and Power Implications

    Authors: Seonho Lee, Jihwan Oh, Junkyum Kim, Seokjin Go, Jongse Park, Divya Mahajan

    Abstract: This paper provides an in-depth characterization of GPU-accelerated systems, to understand the interplay between overlapping computation and communication which is commonly employed in distributed training settings. Due to the large size of models, distributing them across multiple devices is required. Overlapping strategies, which enable concurrent computation and communication, are critical for… ▽ More

    Submitted 3 July, 2025; originally announced July 2025.

  28. Enhancing Multi-Exposure High Dynamic Range Imaging with Overlapped Codebook for Improved Representation Learning

    Authors: Keuntek Lee, Jaehyun Park, Nam Ik Cho

    Abstract: High dynamic range (HDR) imaging technique aims to create realistic HDR images from low dynamic range (LDR) inputs. Specifically, Multi-exposure HDR imaging uses multiple LDR frames taken from the same scene to improve reconstruction performance. However, there are often discrepancies in motion among the frames, and different exposure settings for each capture can lead to saturated regions. In thi… ▽ More

    Submitted 2 July, 2025; originally announced July 2025.

    Comments: Accepted to International Conference on Pattern Recognition. Springer, Cham, 2025 (ICPR 2024)

  29. arXiv:2507.01496  [pdf, ps, other

    cs.CV

    ReFlex: Text-Guided Editing of Real Images in Rectified Flow via Mid-Step Feature Extraction and Attention Adaptation

    Authors: Jimyeong Kim, Jungwon Park, Yeji Song, Nojun Kwak, Wonjong Rhee

    Abstract: Rectified Flow text-to-image models surpass diffusion models in image quality and text alignment, but adapting ReFlow for real-image editing remains challenging. We propose a new real-image editing method for ReFlow by analyzing the intermediate representations of multimodal transformer blocks and identifying three key features. To extract these features from real images with sufficient structural… ▽ More

    Submitted 2 July, 2025; originally announced July 2025.

    Comments: Published at ICCV 2025. Project page: https://wlaud1001.github.io/ReFlex/

  30. arXiv:2507.01415  [pdf, ps, other

    math.OC math.NA

    Randomized subspace correction methods for convex optimization

    Authors: Boou Jiang, Jongho Park, Jinchao Xu

    Abstract: This paper introduces an abstract framework for randomized subspace correction methods for convex optimization, which unifies and generalizes a broad class of existing algorithms, including domain decomposition, multigrid, and block coordinate descent methods. We provide a convergence rate analysis ranging from minimal assumptions to more practical settings, such as sharpness and strong convexity.… ▽ More

    Submitted 2 July, 2025; originally announced July 2025.

    Comments: 21 pages, 0 figures

    MSC Class: 90C25; 65N55; 65J05; 90C06

  31. arXiv:2507.01249  [pdf, ps, other

    hep-ex

    Search for an Axion-Like Particle in $B\rightarrow K^{(*)} a (\rightarrowγγ)$ Decays at Belle

    Authors: Belle, Belle II Collaborations, :, I. Adachi, L. Aggarwal, H. Ahmed, Y. Ahn, H. Aihara, N. Akopov, S. Alghamdi, M. Alhakami, A. Aloisio, N. Althubiti, K. Amos, M. Angelsmark, N. Anh Ky, C. Antonioli, D. M. Asner, H. Atmacan, T. Aushev, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae , et al. (400 additional authors not shown)

    Abstract: We report a search for an axion-like particle $a$ in $B\rightarrow K^{(*)} a (\rightarrowγγ)$ decays using data collected with the Belle detector at the KEKB asymmetric energy electron-positron collider. The search is based on a $711 \mathrm{fb^{-1}}$ data sample collected at the $Υ4S$ resonance energy, corresponding to a sample of $772\times10^6$ $Υ4S$ events. In this study, we search for the dec… ▽ More

    Submitted 3 July, 2025; v1 submitted 1 July, 2025; originally announced July 2025.

    Comments: 26 pages, 15 Figures

    Report number: Belle II Preprint: 2025-017 KEK Preprint: 2025-16

  32. arXiv:2507.00726  [pdf, ps, other

    cs.AI cs.LG

    Can Large Language Models Develop Strategic Reasoning? Post-training Insights from Learning Chess

    Authors: Dongyoon Hwang, Hojoon Lee, Jaegul Choo, Dongmin Park, Jongho Park

    Abstract: While reinforcement learning (RL) for large language models (LLMs) has shown promise in mathematical reasoning, strategic reasoning for LLMs using RL remains largely unexplored. We investigate whether LLMs can develop strategic reasoning capabilities through RL in chess. To this end, we leverage a chess-pretrained action-value network to provide dense reward on the LLM's output move quality, which… ▽ More

    Submitted 2 July, 2025; v1 submitted 1 July, 2025; originally announced July 2025.

    Comments: 27 pages

  33. arXiv:2507.00480  [pdf, ps, other

    cs.LG stat.ML

    Posterior Inference in Latent Space for Scalable Constrained Black-box Optimization

    Authors: Kiyoung Om, Kyuil Sim, Taeyoung Yun, Hyeongyu Kang, Jinkyoo Park

    Abstract: Optimizing high-dimensional black-box functions under black-box constraints is a pervasive task in a wide range of scientific and engineering problems. These problems are typically harder than unconstrained problems due to hard-to-find feasible regions. While Bayesian optimization (BO) methods have been developed to solve such problems, they often struggle with the curse of dimensionality. Recentl… ▽ More

    Submitted 1 July, 2025; originally announced July 2025.

    Comments: 25 pages, 11 figures, 5 tables. Equal contribution by Kiyoung Om, Kyuil Sim, and Taeyoung Yun

  34. arXiv:2507.00198  [pdf, ps, other

    cs.HC

    Exploring AR Label Placements in Visually Cluttered Scenarios

    Authors: Ji Hwan Park, Braden Roper, Amirhossein Arezoumand, Tien Tran

    Abstract: We investigate methods for placing labels in AR environments that have visually cluttered scenes. As the number of items increases in a scene within the user' FOV, it is challenging to effectively place labels based on existing label placement guidelines. To address this issue, we implemented three label placement techniques for in-view objects for AR applications. We specifically target a scenari… ▽ More

    Submitted 30 June, 2025; originally announced July 2025.

  35. arXiv:2506.23552  [pdf, ps, other

    cs.CV cs.SD eess.AS

    JAM-Flow: Joint Audio-Motion Synthesis with Flow Matching

    Authors: Mingi Kwon, Joonghyuk Shin, Jaeseok Jung, Jaesik Park, Youngjung Uh

    Abstract: The intrinsic link between facial motion and speech is often overlooked in generative modeling, where talking head synthesis and text-to-speech (TTS) are typically addressed as separate tasks. This paper introduces JAM-Flow, a unified framework to simultaneously synthesize and condition on both facial motion and speech. Our approach leverages flow matching and a novel Multi-Modal Diffusion Transfo… ▽ More

    Submitted 30 June, 2025; originally announced June 2025.

    Comments: project page: https://joonghyuk.com/jamflow-web Under review. Preprint published on arXiv

  36. arXiv:2506.23530  [pdf, ps, other

    physics.plasm-ph

    Investigation of resonant layer response in electron viscosity regime

    Authors: Yeongsun Lee, Jace Waybright, Jong-Kyu Park

    Abstract: We present a supplementary study of previous work in Waybright and Park [Phys. Plasmas 31, 022502 (2024)] which demonstrates a substantial effect of electron viscosity on the resonant layer response to non-axisymmetric magnetic perturbations. A main refinement is to include a curl element of electron viscosity in the generalized Ohm's law. The refinement reveals a resonant layer response in the El… ▽ More

    Submitted 30 June, 2025; originally announced June 2025.

  37. arXiv:2506.23529  [pdf, ps, other

    cs.CV cs.LG

    When Test-Time Adaptation Meets Self-Supervised Models

    Authors: Jisu Han, Jihee Park, Dongyoon Han, Wonjun Hwang

    Abstract: Training on test-time data enables deep learning models to adapt to dynamic environmental changes, enhancing their practical applicability. Online adaptation from source to target domains is promising but it remains highly reliant on the performance of source pretrained model. In this paper, we investigate whether test-time adaptation (TTA) methods can continuously improve models trained via self-… ▽ More

    Submitted 30 June, 2025; originally announced June 2025.

    Comments: 15 pages, 7 figures

  38. arXiv:2506.23518  [pdf, ps, other

    cs.CV

    WAVE: Warp-Based View Guidance for Consistent Novel View Synthesis Using a Single Image

    Authors: Jiwoo Park, Tae Eun Choi, Youngjun Jun, Seong Jae Hwang

    Abstract: Generating high-quality novel views of a scene from a single image requires maintaining structural coherence across different views, referred to as view consistency. While diffusion models have driven advancements in novel view synthesis, they still struggle to preserve spatial continuity across views. Diffusion models have been combined with 3D models to address the issue, but such approaches lac… ▽ More

    Submitted 30 June, 2025; originally announced June 2025.

  39. arXiv:2506.22694  [pdf, ps, other

    cs.CL

    VOCABTRIM: Vocabulary Pruning for Efficient Speculative Decoding in LLMs

    Authors: Raghavv Goel, Sudhanshu Agrawal, Mukul Gagrani, Junyoung Park, Yifan Zao, He Zhang, Tian Liu, Yiping Yang, Xin Yuan, Jiuyan Lu, Chris Lott, Mingu Lee

    Abstract: In this paper, we introduce a simple training-free technique to improve the performance of drafter-based speculative decoding (SpD) methods that incorporates language modeling head (LM head) during drafting process. A drafter-based speculative decoding leverages one or more smaller language models, a.k.a. drafters or draft models, to sample a draft sequence or tree consisting of multiple tokens, f… ▽ More

    Submitted 3 July, 2025; v1 submitted 27 June, 2025; originally announced June 2025.

    Comments: 8 pages, 4 figures, 5 tables, accepted at ICML 2025 workshop on Efficient Systems for Foundational Models

  40. arXiv:2506.21944  [pdf, ps, other

    physics.soc-ph

    Ranking dynamics in movies and music

    Authors: Hyun-Woo Lee, Gerardo Iñiguez, Hang-Hyun Jo, Hye Jin Park

    Abstract: Ranking systems are widely used to simplify and interpret complex data across diverse domains, from economic indicators and sports scores to online content popularity. While previous studies including the Zipf's law have focused on the static, aggregated properties of ranks, in recent years researchers have begun to uncover generic features in their temporal dynamics. In this work, we introduce an… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

  41. arXiv:2506.21896  [pdf, ps, other

    cs.HC

    Focus on the Experts: Co-designing an Augmented Reality Eye-Gaze Tracking System with Surgical Trainees to Improve Endoscopic Instruction

    Authors: Jumanh Atoum, Jinkyung Park, Mamtaj Akter, Nicholas Kavoussi, Pamela Wisniewski, Jie Ying Wu

    Abstract: The current apprenticeship model for surgical training requires a high level of supervision, which does not scale well to meet the growing need for more surgeons. Many endoscopic procedures are directly taught in the operating room (OR) while the attending surgeon and trainee operate on patients. The need to prioritize patient care limits the trainees' opportunities to experiment and receive feedb… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

  42. arXiv:2506.21595  [pdf, ps, other

    cs.CL

    Thunder-LLM: Efficiently Adapting LLMs to Korean with Minimal Resources

    Authors: Jinpyo Kim, Gyeongje Cho, Chanwoo Park, Jongwon Park, Jongmin Kim, Yeonkyoun So, Jaejin Lee

    Abstract: Since state-of-the-art LLMs often underperform in languages other than English or Chinese, improving the capability of LLMs in new languages has become an essential task. Moreover, LLMs' entire end-to-end training process remains largely unknown to the public due to proprietary reasons, technical complexity, inconsistent documentation, and ethical considerations. The complete picture remains a clo… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

    Comments: Submitted to ARR 2025 May cycle

  43. arXiv:2506.21556  [pdf, ps, other

    cs.CL

    VAT-KG: Knowledge-Intensive Multimodal Knowledge Graph Dataset for Retrieval-Augmented Generation

    Authors: Hyeongcheol Park, MinHyuk Jang, Ha Dam Baek, Gyusam Chang, Jiyoung Seo, Jiwan Park, Hogun Park, Sangpil Kim

    Abstract: Multimodal Knowledge Graphs (MMKGs), which represent explicit knowledge across multiple modalities, play a pivotal role by complementing the implicit knowledge of Multimodal Large Language Models (MLLMs) and enabling more grounded reasoning via Retrieval Augmented Generation (RAG). However, existing MMKGs are generally limited in scope: they are often constructed by augmenting pre-existing knowled… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

    Comments: Project Page: https://vatkg.github.io/

  44. arXiv:2506.21174  [pdf

    eess.AS cs.LG

    Performance improvement of spatial semantic segmentation with enriched audio features and agent-based error correction for DCASE 2025 Challenge Task 4

    Authors: Jongyeon Park, Joonhee Lee, Do-Hyeon Lim, Hong Kook Kim, Hyeongcheol Geum, Jeong Eun Lim

    Abstract: This technical report presents submission systems for Task 4 of the DCASE 2025 Challenge. This model incorporates additional audio features (spectral roll-off and chroma features) into the embedding feature extracted from the mel-spectral feature to im-prove the classification capabilities of an audio-tagging model in the spatial semantic segmentation of sound scenes (S5) system. This approach is… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: DCASE 2025 challenge Task4, 5 pages

  45. arXiv:2506.21143  [pdf, ps, other

    hep-th

    $\mathbf{O}(D,D)$-Symmetric Box Operator and $α^{\prime}$-Corrections with Riemann Curvature

    Authors: Kawon Lee, Jeong-Hyuck Park

    Abstract: Within the framework of Double Field Theory, we construct an $\mathbf{O}(D,D)$-symmetric d'Alembertian, or box operator, that is applicable to tensors of arbitrary rank. Parameterized by the Riemannian metric and the $B$-field, the operator naturally incorporates the Riemann curvature tensor and the $H$-flux. When applied to the massless string sector, it produces a consistent stringy wave equatio… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: 7+8 pages

  46. arXiv:2506.21021  [pdf, ps, other

    gr-qc astro-ph.IM

    Identification of Noise-Associated Glitches in KAGRA O3GK with Hveto

    Authors: T. Akutsu, M. Ando, M. Aoumi, A. Araya, Y. Aso, L. Baiotti, R. Bajpai, K. Cannon, A. H. -Y. Chen, D. Chen, H. Chen, A. Chiba, C. Chou, M. Eisenmann, K. Endo, T. Fujimori, S. Garg, D. Haba, S. Haino, R. Harada, H. Hayakawa, K. Hayama, S. Fujii, Y. Himemoto, N. Hirata , et al. (127 additional authors not shown)

    Abstract: Transient noise ("glitches") in gravitational wave detectors can mimic or obscure true signals, significantly reducing detection sensitivity. Identifying and excluding glitch-contaminated data segments is therefore crucial for enhancing the performance of gravitational-wave searches. We perform a noise analysis of the KAGRA data obtained during the O3GK observation. Our analysis is performed with… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: To appear in Progress of Theoretical and Experimental Physics (PTEP), accepted June 2025

  47. arXiv:2506.19697  [pdf, ps, other

    cs.LG cs.AI cs.CL

    Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large Language Models

    Authors: Jungwoo Park, Taewhoo Lee, Chanwoong Yoon, Hyeon Hwang, Jaewoo Kang

    Abstract: Extreme activation outliers in Large Language Models (LLMs) critically degrade quantization performance, hindering efficient on-device deployment. While channel-wise operations and adaptive gradient scaling are recognized causes, practical mitigation remains challenging. We introduce Outlier-Safe Pre-Training (OSP), a practical guideline that proactively prevents outlier formation rather than rely… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

  48. arXiv:2506.19451  [pdf, ps, other

    eess.SP cs.LG

    Low-Complexity Semantic Packet Aggregation for Token Communication via Lookahead Search

    Authors: Seunghun Lee, Jihong Park, Jinho Choi, Hyuncheol Park

    Abstract: Tokens are fundamental processing units of generative AI (GenAI) and large language models (LLMs), and token communication (TC) is essential for enabling remote AI-generate content (AIGC) and wireless LLM applications. Unlike traditional bits, each of which is independently treated, the semantics of each token depends on its surrounding context tokens. This inter-token dependency makes TC vulnerab… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

  49. arXiv:2506.19389  [pdf, ps, other

    cs.CV

    Emergence of Text Readability in Vision Language Models

    Authors: Jaeyoo Park, Sanghyuk Chun, Wonjae Kim, Sangdoo Yun, Bohyung Han

    Abstract: We investigate how the ability to recognize textual content within images emerges during the training of Vision-Language Models (VLMs). Our analysis reveals a critical phenomenon: the ability to read textual information in a given image \textbf{(text readability)} emerges abruptly after substantial training iterations, in contrast to semantic content understanding which develops gradually from the… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

    Comments: EVAL-FoMo Workshop @ CVPR 2025

  50. arXiv:2506.19144  [pdf, ps, other

    stat.ML cs.LG

    Posterior Contraction for Sparse Neural Networks in Besov Spaces with Intrinsic Dimensionality

    Authors: Kyeongwon Lee, Lizhen Lin, Jaewoo Park, Seonghyun Jeong

    Abstract: This work establishes that sparse Bayesian neural networks achieve optimal posterior contraction rates over anisotropic Besov spaces and their hierarchical compositions. These structures reflect the intrinsic dimensionality of the underlying function, thereby mitigating the curse of dimensionality. Our analysis shows that Bayesian neural networks equipped with either sparse or continuous shrinkage… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.