Skip to main content

Showing 1–50 of 10,762 results for author: Chen, J

.
  1. arXiv:2506.15533  [pdf, ps, other

    hep-ex

    Measurements of the absolute branching fractions of the doubly Cabibbo-suppressed decays $D^+\to K^+π^0$, $D^+\to K^+η$ and $D^+\to K^+η^{\prime}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (697 additional authors not shown)

    Abstract: Using $20.3\,\rm fb^{-1}$ of $e^+e^-$ collision data collected at a center-of-mass energy of 3.773\,GeV with the BESIII detector, we present improved measurements of the absolute branching fractions of the doubly Cabibbo-suppressed decays $D^+\to K^+π^0$, $D^+\to K^+η$ and $ D^+ \to K^+ η^{\prime}$ with the double-tag method. The statistical significance of each signal decay exceeds $10σ$. The bra… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

    Comments: 20 pages, 4 figures

  2. arXiv:2506.15524  [pdf, ps, other

    cs.CV

    NTIRE 2025 Image Shadow Removal Challenge Report

    Authors: Florin-Alexandru Vasluianu, Tim Seizinger, Zhuyun Zhou, Cailian Chen, Zongwei Wu, Radu Timofte, Mingjia Li, Jin Hu, Hainuo Wang, Hengxing Liu, Jiarui Wang, Qiming Hu, Xiaojie Guo, Xin Lu, Jiarong Yang, Yuanfei Bao, Anya Hu, Zihao Fan, Kunyu Wang, Jie Xiao, Xi Wang, Xueyang Fu, Zheng-Jun Zha, Yu-Fan Lin, Chia-Ming Lee , et al. (57 additional authors not shown)

    Abstract: This work examines the findings of the NTIRE 2025 Shadow Removal Challenge. A total of 306 participants have registered, with 17 teams successfully submitting their solutions during the final evaluation phase. Following the last two editions, this challenge had two evaluation tracks: one focusing on reconstruction fidelity and the other on visual perception through a user study. Both tracks were e… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

  3. arXiv:2506.15447  [pdf, ps, other

    eess.SY cs.RO

    Model Predictive Path-Following Control for a Quadrotor

    Authors: David Leprich, Mario Rosenfelder, Mario Hermle, Jingshan Chen, Peter Eberhard

    Abstract: Automating drone-assisted processes is a complex task. Many solutions rely on trajectory generation and tracking, whereas in contrast, path-following control is a particularly promising approach, offering an intuitive and natural approach to automate tasks for drones and other vehicles. While different solutions to the path-following problem have been proposed, most of them lack the capability to… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

    Comments: 15 pages, 11 figures, submitted to PAMM 2025

    MSC Class: 93-XX

  4. arXiv:2506.15242  [pdf, ps, other

    cs.CV

    RA-NeRF: Robust Neural Radiance Field Reconstruction with Accurate Camera Pose Estimation under Complex Trajectories

    Authors: Qingsong Yan, Qiang Wang, Kaiyong Zhao, Jie Chen, Bo Li, Xiaowen Chu, Fei Deng

    Abstract: Neural Radiance Fields (NeRF) and 3D Gaussian Splatting (3DGS) have emerged as powerful tools for 3D reconstruction and SLAM tasks. However, their performance depends heavily on accurate camera pose priors. Existing approaches attempt to address this issue by introducing external constraints but fall short of achieving satisfactory accuracy, particularly when camera trajectories are complex. In th… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

    Comments: IROS 2025

  5. arXiv:2506.15120  [pdf, ps, other

    cs.IR cs.AI cs.LG

    Advancing Loss Functions in Recommender Systems: A Comparative Study with a Rényi Divergence-Based Solution

    Authors: Shengjia Zhang, Jiawei Chen, Changdong Li, Sheng Zhou, Qihao Shi, Yan Feng, Chun Chen, Can Wang

    Abstract: Loss functions play a pivotal role in optimizing recommendation models. Among various loss functions, Softmax Loss (SL) and Cosine Contrastive Loss (CCL) are particularly effective. Their theoretical connections and differences warrant in-depth exploration. This work conducts comprehensive analyses of these losses, yielding significant insights: 1) Common strengths -- both can be viewed as augment… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

    Comments: AAAI 2025

  6. arXiv:2506.15087  [pdf, ps, other

    cs.RO

    3D Vision-tactile Reconstruction from Infrared and Visible Images for Robotic Fine-grained Tactile Perception

    Authors: Yuankai Lin, Xiaofan Lu, Jiahui Chen, Hua Yang

    Abstract: To achieve human-like haptic perception in anthropomorphic grippers, the compliant sensing surfaces of vision tactile sensor (VTS) must evolve from conventional planar configurations to biomimetically curved topographies with continuous surface gradients. However, planar VTSs have challenges when extended to curved surfaces, including insufficient lighting of surfaces, blurring in reconstruction,… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

  7. arXiv:2506.15050  [pdf, ps, other

    cs.AI

    Truncated Proximal Policy Optimization

    Authors: Tiantian Fan, Lingjun Liu, Yu Yue, Jiaze Chen, Chengyi Wang, Qiying Yu, Chi Zhang, Zhiqi Lin, Ruofei Zhu, Yufeng Yuan, Xiaochen Zuo, Bole Ma, Mofan Zhang, Gaohong Liu, Ru Zhang, Haotian Zhou, Cong Xie, Ruidong Zhu, Zhi Zhang, Xin Liu, Mingxuan Wang, Lin Yan, Yonghui Wu

    Abstract: Recently, test-time scaling Large Language Models (LLMs) have demonstrated exceptional reasoning capabilities across scientific and professional tasks by generating long chains-of-thought (CoT). As a crucial component for developing these reasoning models, reinforcement learning (RL), exemplified by Proximal Policy Optimization (PPO) and its variants, allows models to learn through trial and error… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

  8. arXiv:2506.14916  [pdf, ps, other

    math.NA

    Interpolation-based reproducing kernel particle method

    Authors: Jennifer E. Fromm, John A. Evans, J. S. Chen

    Abstract: Meshfree methods, including the reproducing kernel particle method (RKPM), have been widely used within the computational mechanics community to model physical phenomena in materials undergoing large deformations or extreme topology changes. RKPM shape functions and their derivatives cannot be accurately integrated with the Gauss-quadrature methods widely employed for the finite element method (FE… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

  9. arXiv:2506.14696  [pdf, ps, other

    cs.CV

    YOLOv11-RGBT: Towards a Comprehensive Single-Stage Multispectral Object Detection Framework

    Authors: Dahang Wan, Rongsheng Lu, Yang Fang, Xianli Lang, Shuangbao Shu, Jingjing Chen, Siyuan Shen, Ting Xu, Zecong Ye

    Abstract: Multispectral object detection, which integrates information from multiple bands, can enhance detection accuracy and environmental adaptability, holding great application potential across various fields. Although existing methods have made progress in cross-modal interaction, low-light conditions, and model lightweight, there are still challenges like the lack of a unified single-stage framework,… ▽ More

    Submitted 18 June, 2025; v1 submitted 17 June, 2025; originally announced June 2025.

    Comments: 29 pages, 8 figures . The errors in the first version have been corrected, and no new version will be submitted in the near future. The next version will include more experiments

  10. arXiv:2506.14549  [pdf, ps, other

    cs.CV

    DreamLight: Towards Harmonious and Consistent Image Relighting

    Authors: Yong Liu, Wenpeng Xiao, Qianqian Wang, Junlin Chen, Shiyin Wang, Yitong Wang, Xinglong Wu, Yansong Tang

    Abstract: We introduce a model named DreamLight for universal image relighting in this work, which can seamlessly composite subjects into a new background while maintaining aesthetic uniformity in terms of lighting and color tone. The background can be specified by natural images (image-based relighting) or generated from unlimited text prompts (text-based relighting). Existing studies primarily focus on im… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

  11. arXiv:2506.14477  [pdf, ps, other

    cs.AI

    GUI-Robust: A Comprehensive Dataset for Testing GUI Agent Robustness in Real-World Anomalies

    Authors: Jingqi Yang, Zhilong Song, Jiawei Chen, Mingli Song, Sheng Zhou, linjun sun, Xiaogang Ouyang, Chun Chen, Can Wang

    Abstract: The development of high-quality datasets is crucial for benchmarking and advancing research in Graphical User Interface (GUI) agents. Despite their importance, existing datasets are often constructed under idealized conditions, overlooking the diverse anomalies frequently encountered in real-world deployments. To address this limitation, we introduce GUI-Robust, a novel dataset designed for compre… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

    Comments: 10 pages, 4 figures, submitted to NIPS 2025

  12. arXiv:2506.14418  [pdf, ps, other

    cs.CV cs.AI

    Compositional Attribute Imbalance in Vision Datasets

    Authors: Jiayi Chen, Yanbiao Ma, Andi Zhang, Weidong Tang, Wei Dai, Bowei Liu

    Abstract: Visual attribute imbalance is a common yet underexplored issue in image classification, significantly impacting model performance and generalization. In this work, we first define the first-level and second-level attributes of images and then introduce a CLIP-based framework to construct a visual attribute dictionary, enabling automatic evaluation of image attributes. By systematically analyzing b… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

  13. arXiv:2506.14346  [pdf, ps, other

    astro-ph.GA astro-ph.SR

    Measurements of the Diffuse Interstellar Bands at 5780, 5797, and 6614 Å in the Hot Stellar Spectra of the LAMOST LRS DR10

    Authors: Xiao-Xiao Ma, A-Li Luo, Jian-Jun Chen, Jing Chen, Jun-Chao Liang

    Abstract: Diffuse Interstellar Bands (DIBs) are crucial tracers of the interstellar medium (ISM), yet their carriers remain poorly understood. While large-scale surveys have advanced DIB studies in cool stellar spectra, measurements in hot stellar spectra are still limited. Using 287 277 high signal-to-noise (S/N $>$ 50) hot stellar spectra from the tenth data release of the Large Sky Area Multi-Object Fibe… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

    Comments: 13 pages, 3 tables, 9 figures, accepted for publication in RAA

    Report number: RAA-2025-0122

  14. arXiv:2506.14224  [pdf, ps, other

    cs.AI

    From Black Boxes to Transparent Minds: Evaluating and Enhancing the Theory of Mind in Multimodal Large Language Models

    Authors: Xinyang Li, Siqi Liu, Bochao Zou, Jiansheng Chen, Huimin Ma

    Abstract: As large language models evolve, there is growing anticipation that they will emulate human-like Theory of Mind (ToM) to assist with routine tasks. However, existing methods for evaluating machine ToM focus primarily on unimodal models and largely treat these models as black boxes, lacking an interpretative exploration of their internal mechanisms. In response, this study adopts an approach based… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

    Comments: 24 pages, 22 figures, accepted at ICML 2025, project page: see https://annaisavailable.github.io/GridToM/

  15. arXiv:2506.14168  [pdf, ps, other

    cs.CV cs.AI

    VideoMAR: Autoregressive Video Generatio with Continuous Tokens

    Authors: Hu Yu, Biao Gong, Hangjie Yuan, DanDan Zheng, Weilong Chai, Jingdong Chen, Kecheng Zheng, Feng Zhao

    Abstract: Masked-based autoregressive models have demonstrated promising image generation capability in continuous space. However, their potential for video generation remains under-explored. In this paper, we propose \textbf{VideoMAR}, a concise and efficient decoder-only autoregressive image-to-video model with continuous tokens, composing temporal frame-by-frame and spatial masked generation. We first id… ▽ More

    Submitted 18 June, 2025; v1 submitted 17 June, 2025; originally announced June 2025.

  16. arXiv:2506.14098  [pdf, ps, other

    cs.LG cs.AI

    Toward a Graph Foundation Model: Pre-Training Transformers With Random Walks

    Authors: Ziyuan Tang, Jie Chen

    Abstract: A foundation model like GPT elicits many emergent abilities, owing to the pre-training with broad inclusion of data and the use of the powerful Transformer architecture. While foundation models in natural languages are prevalent, can we build similar models for graphs? This paper describes an approach toward a graph foundation model that is pre-trained with diverse graph datasets by adapting the T… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  17. arXiv:2506.13955  [pdf, ps, other

    stat.ML cs.CR cs.LG stat.AP

    Bridging Unsupervised and Semi-Supervised Anomaly Detection: A Theoretically-Grounded and Practical Framework with Synthetic Anomalies

    Authors: Matthew Lau, Tian-Yi Zhou, Xiangchi Yuan, Jizhou Chen, Wenke Lee, Xiaoming Huo

    Abstract: Anomaly detection (AD) is a critical task across domains such as cybersecurity and healthcare. In the unsupervised setting, an effective and theoretically-grounded principle is to train classifiers to distinguish normal data from (synthetic) anomalies. We extend this principle to semi-supervised AD, where training data also include a limited labeled subset of anomalies possibly present in test tim… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  18. arXiv:2506.13898  [pdf, ps, other

    quant-ph cond-mat.quant-gas

    Dynamical quantum phase transition with divergent multipartite entanglement

    Authors: Jie Chen, Ricardo Costa de Almeida, Hendrik Weimer

    Abstract: We investigate the nonequilibrium quench dynamics of the one-dimensional transverse-field Ising model in both integrable and nonintegrable regimes. In particular, we report on a novel type of dynamical quantum phase transition (DQPT) that is characterized by a divergent multipartite entanglement at critical times in the post-quench dynamics. We quantify the multipartite entanglement of the state b… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

    Comments: 7 pages, 6 figures

  19. arXiv:2506.13782  [pdf, ps, other

    cs.IR cs.AI

    XGraphRAG: Interactive Visual Analysis for Graph-based Retrieval-Augmented Generation

    Authors: Ke Wang, Bo Pan, Yingchaojie Feng, Yuwei Wu, Jieyi Chen, Minfeng Zhu, Wei Chen

    Abstract: Graph-based Retrieval-Augmented Generation (RAG) has shown great capability in enhancing Large Language Model (LLM)'s answer with an external knowledge base. Compared to traditional RAG, it introduces a graph as an intermediate representation to capture better structured relational knowledge in the corpus, elevating the precision and comprehensiveness of generation results. However, developers usu… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

    Comments: Accepted to IEEE Pacific Visualization Conference 2025

  20. arXiv:2506.13725  [pdf, ps, other

    cs.RO

    CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding

    Authors: Wenxuan Song, Jiayi Chen, Pengxiang Ding, Yuxin Huang, Han Zhao, Donglin Wang, Haoang Li

    Abstract: In recent years, Vision-Language-Action (VLA) models have become a vital research direction in robotics due to their impressive multimodal understanding and generalization capabilities. Despite the progress, their practical deployment is severely constrained by inference speed bottlenecks, particularly in high-frequency and dexterous manipulation tasks. While recent studies have explored Jacobi de… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

    Comments: 16 pages

  21. arXiv:2506.13695  [pdf, ps, other

    cs.IR

    OneRec Technical Report

    Authors: Guorui Zhou, Jiaxin Deng, Jinghao Zhang, Kuo Cai, Lejian Ren, Qiang Luo, Qianqian Wang, Qigen Hu, Rui Huang, Shiyao Wang, Weifeng Ding, Wuchao Li, Xinchen Luo, Xingmei Wang, Zexuan Cheng, Zixing Zhang, Bin Zhang, Boxuan Wang, Chaoyi Ma, Chengru Song, Chenhui Wang, Di Wang, Dongxue Meng, Fan Yang, Fangyu Zhang , et al. (40 additional authors not shown)

    Abstract: Recommender systems have been widely used in various large-scale user-oriented platforms for many years. However, compared to the rapid developments in the AI community, recommendation systems have not achieved a breakthrough in recent years. For instance, they still rely on a multi-stage cascaded architecture rather than an end-to-end approach, leading to computational fragmentation and optimizat… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

    Comments: Authors are listed alphabetically by their first name

  22. arXiv:2506.13529  [pdf, ps, other

    cs.LG cs.AI

    Seismic Acoustic Impedance Inversion Framework Based on Conditional Latent Generative Diffusion Model

    Authors: Jie Chen, Hongling Chen, Jinghuai Gao, Chuangji Meng, Tao Yang, XinXin Liang

    Abstract: Seismic acoustic impedance plays a crucial role in lithological identification and subsurface structure interpretation. However, due to the inherently ill-posed nature of the inversion problem, directly estimating impedance from post-stack seismic data remains highly challenging. Recently, diffusion models have shown great potential in addressing such inverse problems due to their strong prior lea… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

    Comments: This work has been submitted to the IEEE for possible publication

  23. arXiv:2506.13444  [pdf, ps, other

    cs.CV

    Self-Supervised Enhancement for Depth from a Lightweight ToF Sensor with Monocular Images

    Authors: Laiyan Ding, Hualie Jiang, Jiwei Chen, Rui Huang

    Abstract: Depth map enhancement using paired high-resolution RGB images offers a cost-effective solution for improving low-resolution depth data from lightweight ToF sensors. Nevertheless, naively adopting a depth estimation pipeline to fuse the two modalities requires groundtruth depth maps for supervision. To address this, we propose a self-supervised learning framework, SelfToF, which generates detailed… ▽ More

    Submitted 17 June, 2025; v1 submitted 16 June, 2025; originally announced June 2025.

    Comments: accepted by IROS 2025

  24. arXiv:2506.13428  [pdf, ps, other

    cs.RO

    VLM-SFD: VLM-Assisted Siamese Flow Diffusion Framework for Dual-Arm Cooperative Manipulation

    Authors: Jiaming Chen, Yiyu Jiang, Aoshen Huang, Yang Li, Wei Pan

    Abstract: Dual-arm cooperative manipulation holds great promise for tackling complex real-world tasks that demand seamless coordination and adaptive dynamics. Despite substantial progress in learning-based motion planning, most approaches struggle to generalize across diverse manipulation tasks and adapt to dynamic, unstructured environments, particularly in scenarios involving interactions between two obje… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  25. arXiv:2506.13376  [pdf, ps, other

    math.RT math.AG

    ACM tilting bundles on a Geigle-Lenzing projective plane of type $(2,2,2,p)$

    Authors: Jianmin Chen, Shiquan Ruan, Weikang Weng

    Abstract: Let $\mathbb{X}$ be a Geigle-Lenzing projective plane of type $(2,2,2,p)$ and $\mathsf{coh} \mathbb{X}$ the category of coherent sheaves on $\mathbb{X}$. This paper is devoted to study ACM tilting bundles over $\mathbb{X}$, that is, tilting objects in the derived category $\mathsf{D}^{\rm b}(\mathsf{coh} \, \mathbb{X})$ that are also ACM bundles. We show that a tilting bundle consisting of line bu… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

    Comments: 28 pages, comments welcome!

  26. arXiv:2506.13334  [pdf, ps, other

    hep-ex

    Measurement of the $Ω_c^0$ and $Ξ_c^0$ baryon lifetimes using hadronic $b$-baryon decays

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1141 additional authors not shown)

    Abstract: The lifetimes of the $Ω_c^0$ and $Ξ_c^0$ baryons are measured using a $pp$ collision dataset collected by the LHCb experiment, corresponding to an integrated luminosity of $9~\rm{fb^{-1}}$. The charm baryons are produced in the fully reconstructed decay chains $Ω_b^- \rightarrow Ω_c^0 (\rightarrow pK^-K^-π^+)~π^-$ and $Ξ_b^- \rightarrow Ξ_c^0 (\rightarrow pK^-K^-π^+)~π^-$. The measurement uses top… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

    Comments: All figures and tables, along with machine-readable versions and any supplementary material and additional information, are available at https://lbfence.cern.ch/alcm/public/analysis/full-details/3875/ (LHCb public pages)

    Report number: LHCb-PAPER-2025-013,CERN-EP-2025-117

  27. arXiv:2506.13245  [pdf, ps, other

    cs.AI cs.CY cs.GT

    A Game-Theoretic Negotiation Framework for Cross-Cultural Consensus in LLMs

    Authors: Guoxi Zhang, Jiawei Chen, Tianzhuo Yang, Jiaming Ji, Yaodong Yang, Juntao Dai

    Abstract: The increasing prevalence of large language models (LLMs) is influencing global value systems. However, these models frequently exhibit a pronounced WEIRD (Western, Educated, Industrialized, Rich, Democratic) cultural bias due to lack of attention to minority values. This monocultural perspective may reinforce dominant values and marginalize diverse cultural viewpoints, posing challenges for the d… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  28. arXiv:2506.13215  [pdf, ps, other

    cs.CV

    DVP-MVS++: Synergize Depth-Normal-Edge and Harmonized Visibility Prior for Multi-View Stereo

    Authors: Zhenlong Yuan, Dapeng Zhang, Zehao Li, Chengxuan Qian, Jianing Chen, Yinda Chen, Kehua Chen, Tianlu Mao, Zhaoxin Li, Hao Jiang, Zhaoqi Wang

    Abstract: Recently, patch deformation-based methods have demonstrated significant effectiveness in multi-view stereo due to their incorporation of deformable and expandable perception for reconstructing textureless areas. However, these methods generally focus on identifying reliable pixel correlations to mitigate matching ambiguity of patch deformation, while neglecting the deformation instability caused b… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  29. arXiv:2506.13151  [pdf, ps, other

    cs.AR

    Reconfigurable Digital RRAM Logic Enables In-Situ Pruning and Learning for Edge AI

    Authors: Songqi Wang, Yue Zhang, Jia Chen, Xinyuan Zhang, Yi Li, Ning Lin, Yangu He, Jichang Yang, Yingjie Yu, Yi Li, Zhongrui Wang, Xiaojuan Qi, Han Wang

    Abstract: The human brain simultaneously optimizes synaptic weights and topology by growing, pruning, and strengthening synapses while performing all computation entirely in memory. In contrast, modern artificial-intelligence systems separate weight optimization from topology optimization and depend on energy-intensive von Neumann architectures. Here, we present a software-hardware co-design that bridges th… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  30. arXiv:2506.13046  [pdf, ps, other

    cond-mat.soft

    Implementing van der Waals forces for polytope particles in DEM simulations of clay

    Authors: Dominik Krengel, Jian Chen, Zhipeng Yu, Hans-Georg Matuttis, Takashi Matsushima

    Abstract: Clay minerals are non-spherical nano-scale particles that usually form flocculated, house-of-card like structures under the influence of inter-molecular forces. Numerical modeling of clays is still in its infancy as the required inter-particle forces are available only for spherical particles. A polytope approach would allow shape-accurate forces and torques while simultaneously being more perform… ▽ More

    Submitted 15 June, 2025; originally announced June 2025.

    Comments: 4 pages, 5 figures, accepted for publication

  31. arXiv:2506.12754  [pdf, ps, other

    cs.LG cs.AI

    AFBS:Buffer Gradient Selection in Semi-asynchronous Federated Learning

    Authors: Chaoyi Lu, Yiding Sun, Jinqian Chen, Zhichuan Yang, Jiangming Pan, Jihua Zhu

    Abstract: Asynchronous federated learning (AFL) accelerates training by eliminating the need to wait for stragglers, but its asynchronous nature introduces gradient staleness, where outdated gradients degrade performance. Existing solutions address this issue with gradient buffers, forming a semi-asynchronous framework. However, this approach struggles when buffers accumulate numerous stale gradients, as bl… ▽ More

    Submitted 15 June, 2025; originally announced June 2025.

  32. arXiv:2506.12712  [pdf, ps, other

    cs.CV eess.IV

    Combining Self-attention and Dilation Convolutional for Semantic Segmentation of Coal Maceral Groups

    Authors: Zhenghao Xi, Zhengnan Lv, Yang Zheng, Xiang Liu, Zhuang Yu, Junran Chen, Jing Hu, Yaqi Liu

    Abstract: The segmentation of coal maceral groups can be described as a semantic segmentation process of coal maceral group images, which is of great significance for studying the chemical properties of coal. Generally, existing semantic segmentation models of coal maceral groups use the method of stacking parameters to achieve higher accuracy. It leads to increased computational requirements and impacts mo… ▽ More

    Submitted 15 June, 2025; originally announced June 2025.

  33. arXiv:2506.12670  [pdf, ps, other

    math.AG math.CV

    Characterization of fiberwise bimeromorphism and specialization of bimeromorphic types I: the non-negative Kodaira dimension case

    Authors: Jian Chen, Sheng Rao, I-Hsun Tsai

    Abstract: Inspired by the recent works of M. Kontsevich--Y. Tschinkel and J. Nicaise--J. C. Ottem on specialization of birational types for smooth families (in the scheme category) and J. Koll{á}r's work on fiberwise bimeromorphism, we focus on characterizing the fiberwise bimeromorphism and utilizing the characterization to investigate the specialization of bimeromorphic types for non-smooth families in th… ▽ More

    Submitted 14 June, 2025; originally announced June 2025.

    Comments: 33 pages

    MSC Class: Primary 14D15; Secondary 32S45; 14E05; 14E08; 14D06; 14C05

  34. arXiv:2506.12481  [pdf, ps, other

    cs.CV cs.LG cs.SD eess.AS

    Exploring Audio Cues for Enhanced Test-Time Video Model Adaptation

    Authors: Runhao Zeng, Qi Deng, Ronghao Zhang, Shuaicheng Niu, Jian Chen, Xiping Hu, Victor C. M. Leung

    Abstract: Test-time adaptation (TTA) aims to boost the generalization capability of a trained model by conducting self-/unsupervised learning during the testing phase. While most existing TTA methods for video primarily utilize visual supervisory signals, they often overlook the potential contribution of inherent audio data. To address this gap, we propose a novel approach that incorporates audio informatio… ▽ More

    Submitted 14 June, 2025; originally announced June 2025.

    Comments: 14 pages, 7 figures

  35. arXiv:2506.12430  [pdf, ps, other

    cs.CR cs.CV

    Pushing the Limits of Safety: A Technical Report on the ATLAS Challenge 2025

    Authors: Zonghao Ying, Siyang Wu, Run Hao, Peng Ying, Shixuan Sun, Pengyu Chen, Junze Chen, Hao Du, Kaiwen Shen, Shangkun Wu, Jiwei Wei, Shiyuan He, Yang Yang, Xiaohai Xu, Ke Ma, Qianqian Xu, Qingming Huang, Shi Lin, Xun Wang, Changting Lin, Meng Han, Yilei Jiang, Siqi Lai, Yaozhi Zheng, Yifei Song , et al. (22 additional authors not shown)

    Abstract: Multimodal Large Language Models (MLLMs) have enabled transformative advancements across diverse applications but remain susceptible to safety threats, especially jailbreak attacks that induce harmful outputs. To systematically evaluate and improve their safety, we organized the Adversarial Testing & Large-model Alignment Safety Grand Challenge (ATLAS) 2025}. This technical report presents finding… ▽ More

    Submitted 14 June, 2025; originally announced June 2025.

  36. arXiv:2506.12382  [pdf, ps, other

    cs.LG cs.AI cs.CR

    Exploring the Secondary Risks of Large Language Models

    Authors: Jiawei Chen, Zhengwei Fang, Xiao Yang, Chao Yu, Zhaoxia Yin, Hang Su

    Abstract: Ensuring the safety and alignment of Large Language Models is a significant challenge with their growing integration into critical applications and societal functions. While prior research has primarily focused on jailbreak attacks, less attention has been given to non-adversarial failures that subtly emerge during benign interactions. We introduce secondary risks a novel class of failure modes ma… ▽ More

    Submitted 14 June, 2025; originally announced June 2025.

    Comments: 18 pages, 5 figures

  37. arXiv:2506.11775  [pdf, ps, other

    cs.RO

    ExoStart: Efficient learning for dexterous manipulation with sensorized exoskeleton demonstrations

    Authors: Zilin Si, Jose Enrique Chen, M. Emre Karagozler, Antonia Bronars, Jonathan Hutchinson, Thomas Lampe, Nimrod Gileadi, Taylor Howell, Stefano Saliceti, Lukasz Barczyk, Ilan Olivarez Correa, Tom Erez, Mohit Shridhar, Murilo Fernandes Martins, Konstantinos Bousmalis, Nicolas Heess, Francesco Nori, Maria Bauza Villalonga

    Abstract: Recent advancements in teleoperation systems have enabled high-quality data collection for robotic manipulators, showing impressive results in learning manipulation at scale. This progress suggests that extending these capabilities to robotic hands could unlock an even broader range of manipulation skills, especially if we could achieve the same level of dexterity that human hands exhibit. However… ▽ More

    Submitted 13 June, 2025; originally announced June 2025.

  38. arXiv:2506.11558  [pdf, ps, other

    cs.CV cs.AI cs.CL

    DaMO: A Data-Efficient Multimodal Orchestrator for Temporal Reasoning with Video LLMs

    Authors: Bo-Cheng Chiu, Jen-Jee Chen, Yu-Chee Tseng, Feng-Chi Chen

    Abstract: Large Language Models (LLMs) have recently been extended to the video domain, enabling sophisticated video-language understanding. However, existing Video LLMs often exhibit limitations in fine-grained temporal reasoning, restricting their ability to precisely attribute responses to specific video moments, especially under constrained supervision. We introduce DaMO, a data-efficient Video LLM expl… ▽ More

    Submitted 13 June, 2025; originally announced June 2025.

  39. arXiv:2506.11543  [pdf, ps, other

    cs.CV cs.AI cs.LG

    FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation

    Authors: Zhuguanyu Wu, Shihe Wang, Jiayi Zhang, Jiaxin Chen, Yunhong Wang

    Abstract: Post-training quantization (PTQ) has stood out as a cost-effective and promising model compression paradigm in recent years, as it avoids computationally intensive model retraining. Nevertheless, current PTQ methods for Vision Transformers (ViTs) still suffer from significant accuracy degradation, especially under low-bit quantization. To address these shortcomings, we analyze the prevailing Hessi… ▽ More

    Submitted 13 June, 2025; originally announced June 2025.

    Comments: CVPR 2025 Highlight

  40. arXiv:2506.11473  [pdf, ps, other

    gr-qc

    Radii of spherical timelike geodesics in Kerr-Newman black holes

    Authors: Wei Huang, Jun-Xu Chen, Jia-Hui Huang

    Abstract: The existence, radii and radial stability of the equatorial and non-equatorial (particularly, the polar) spherical orbits are discussed for particles with different conserved energy. The radii of these orbits generally are solutions of a quintic polynomial equation with four dimensionless parameters. For the case with $γ=1$, we obtain the analytical expressions for the radii of the polar, equatori… ▽ More

    Submitted 13 June, 2025; originally announced June 2025.

    Comments: 15 pages, 35 figures

  41. arXiv:2506.11430  [pdf, ps, other

    cs.CV

    Auto-Connect: Connectivity-Preserving RigFormer with Direct Preference Optimization

    Authors: Jingfeng Guo, Jian Liu, Jinnan Chen, Shiwei Mao, Changrong Hu, Puhua Jiang, Junlin Yu, Jing Xu, Qi Liu, Lixin Xu, Zhuo Chen, Chunchao Guo

    Abstract: We introduce Auto-Connect, a novel approach for automatic rigging that explicitly preserves skeletal connectivity through a connectivity-preserving tokenization scheme. Unlike previous methods that predict bone positions represented as two joints or first predict points before determining connectivity, our method employs special tokens to define endpoints for each joint's children and for each hie… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

  42. arXiv:2506.11127  [pdf, ps, other

    cs.CL cs.AI

    GUIRoboTron-Speech: Towards Automated GUI Agents Based on Speech Instructions

    Authors: Wenkang Han, Zhixiong Zeng, Jing Huang, Shu Jiang, Liming Zheng, Longrong Yang, Haibo Qiu, Chang Yao, Jingyuan Chen, Lin Ma

    Abstract: Autonomous agents for Graphical User Interfaces (GUIs) are revolutionizing human-computer interaction, yet their reliance on text-based instructions imposes limitations on accessibility and convenience, particularly in hands-free scenarios. To address this gap, we propose GUIRoboTron-Speech, the first end-to-end autonomous GUI agent that directly accepts speech instructions and on-device screensho… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

  43. arXiv:2506.10963  [pdf, ps, other

    cs.CV cs.CL

    MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning

    Authors: Yuxuan Luo, Yuhui Yuan, Junwen Chen, Haonan Cai, Ziyi Yue, Yuwei Yang, Fatima Zohra Daha, Ji Li, Zhouhui Lian

    Abstract: In this paper, we introduce knowledge image generation as a new task, alongside the Massive Multi-Discipline Multi-Tier Knowledge-Image Generation Benchmark (MMMG) to probe the reasoning capability of image generation models. Knowledge images have been central to human civilization and to the mechanisms of human learning -- a fact underscored by dual-coding theory and the picture-superiority effec… ▽ More

    Submitted 13 June, 2025; v1 submitted 12 June, 2025; originally announced June 2025.

    Comments: 85 pages, 70 figures, code: https://github.com/MMMGBench/MMMG, project page: https://mmmgbench.github.io/

  44. arXiv:2506.10954  [pdf, ps, other

    cs.SE cs.AI

    SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks

    Authors: Lianghong Guo, Yanlin Wang, Caihua Li, Pengyu Yang, Jiachi Chen, Wei Tao, Yingtian Zou, Duyu Tang, Zibin Zheng

    Abstract: Constructing large-scale datasets for the GitHub issue resolution task is crucial for both training and evaluating the software engineering capabilities of Large Language Models (LLMs). However, the traditional process for creating such benchmarks is notoriously challenging and labor-intensive, particularly in the stages of setting up evaluation environments, grading test outcomes, and validating… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

  45. arXiv:2506.10826  [pdf, ps, other

    cs.RO

    RationalVLA: A Rational Vision-Language-Action Model with Dual System

    Authors: Wenxuan Song, Jiayi Chen, Wenxue Li, Xu He, Han Zhao, Can Cui, Pengxiang Ding Shiyan Su, Feilong Tang, Xuelian Cheng, Donglin Wang, Zongyuan Ge, Xinhu Zheng, Zhe Liu, Hesheng Wang, Haoang Li

    Abstract: A fundamental requirement for real-world robotic deployment is the ability to understand and respond to natural language instructions. Existing language-conditioned manipulation tasks typically assume that instructions are perfectly aligned with the environment. This assumption limits robustness and generalization in realistic scenarios where instructions may be ambiguous, irrelevant, or infeasibl… ▽ More

    Submitted 13 June, 2025; v1 submitted 12 June, 2025; originally announced June 2025.

    Comments: 14 pages

  46. arXiv:2506.10764  [pdf, ps, other

    cs.AI cs.LG

    OPT-BENCH: Evaluating LLM Agent on Large-Scale Search Spaces Optimization Problems

    Authors: Xiaozhe Li, Jixuan Chen, Xinyu Fang, Shengyuan Ding, Haodong Duan, Qingwen Liu, Kai Chen

    Abstract: Large Language Models (LLMs) have shown remarkable capabilities in solving diverse tasks. However, their proficiency in iteratively optimizing complex solutions through learning from previous feedback remains insufficiently explored. To bridge this gap, we present OPT-BENCH, a comprehensive benchmark designed to evaluate LLM agents on large-scale search space optimization problems. OPT-BENCH inclu… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

  47. arXiv:2506.10657  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Electric field control of third-order nonlinear Hall effect

    Authors: Jiaju Yang, Lujun Wei, Yanghui Li, Lina Chen, Wei Niu, Jiarui Chen, Jun Du, Yong Pu

    Abstract: The third-order nonlinear Hall effect (NLHE) serves as a sensitive probe of energy band geometric property, providing a new paradigm for revealing the Berry curvature distribution and topological response of quantum materials. In the Weyl semimetal TaIrTe4, we report for the first time that the sign of the third-order NLHE reverses with decreasing temperature. Through scaling law analysis, we thin… ▽ More

    Submitted 12 June, 2025; v1 submitted 12 June, 2025; originally announced June 2025.

    Comments: 20 pages, 5 figures

  48. arXiv:2506.10571  [pdf, ps, other

    quant-ph

    Unitary Scrambling and Collapse: A Quantum Diffusion Framework for Generative Modeling

    Authors: Yihua Li, Jiayi Chen, Tamanna S. Kumavat, Kyriakos Flouris

    Abstract: Quantum computing, with its promise of exponential speedups, is rapidly emerging as a powerful paradigm for advancing artificial intelligence. We propose QSC-Diffusion, the first fully quantum diffusion-based framework for image generation. Our method integrates classical Gaussian noise with quantum scrambling in the forward process, and employs parameterized quantum circuits with measurement-indu… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

    Comments: Submitted to NeurIPS 2025

  49. arXiv:2506.10453  [pdf, ps, other

    cs.CV eess.IV

    Rethinking Generative Human Video Coding with Implicit Motion Transformation

    Authors: Bolin Chen, Ru-Ling Liao, Jie Chen, Yan Ye

    Abstract: Beyond traditional hybrid-based video codec, generative video codec could achieve promising compression performance by evolving high-dimensional signals into compact feature representations for bitstream compactness at the encoder side and developing explicit motion fields as intermediate supervision for high-quality reconstruction at the decoder side. This paradigm has achieved significant succes… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

  50. arXiv:2506.10439  [pdf, ps, other

    quant-ph

    Photon-mediated interactions by Floquet photonic lattices

    Authors: Jia-Qiang Chen, Peng-Bo Li, Álvaro Gómez-León, Alejandro González-Tudela

    Abstract: We investigate the interactions between two-level emitters mediated by time-dependent, one-dimensional, structured photonic baths, focusing on Floquet topological lattices. Building on the framework of periodically driven photonic lattices, we demonstrate and characterize the emergence of tunable-range emitter's interactions mediated by bound states absent in static photonic lattices. In particula… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.