Skip to main content

Showing 101–150 of 26,487 results for author: Jiang

.
  1. arXiv:2506.18374  [pdf, ps, other

    math.PR math.NA

    Probabilistic approximation of fully nonlinear second-order PIDEs with convergence rates for the universal robust limit theorem

    Authors: Lianzi Jiang, Mingshang Hu, Gechun Liang

    Abstract: This paper develops a probabilistic approximation scheme for a class of nonstandard, fully nonlinear second-order partial integro-differential equations (PIDEs) arising from nonlinear Lévy processes under Peng's G-expectation framework. The PIDE involves a supremum over a set of \(α\)-stable Lévy measures, potentially with degenerate diffusion and a non-separable uncertainty set, which renders exi… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

    Comments: 26 pages

    MSC Class: 60F05; 65M15; 60H30; 45K05

  2. arXiv:2506.18292  [pdf

    cs.CV

    Rapeseed population point cloud completion network (RP-PCN) with dynamic graph convolution for 3D reconstruction of crop canopy occlusion architecture

    Authors: Ziyue Guo, Xin Yang, Yutao Shen, Yang Zhu, Lixi Jiang, Haiyan Cen

    Abstract: Quantitative descriptions of complete canopy architecture are crucial for evaluating crop photosynthesis and yield to guide ideotype design. Although three-dimensional (3D) sensing technologies have been developed for plant and canopy reconstruction, severe occlusion and complex architectures hinder accurate canopy descriptions. In this study, we propose a point cloud completion model for 3D recon… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

  3. arXiv:2506.18268  [pdf, ps, other

    cs.CV

    ThermalLoc: A Vision Transformer-Based Approach for Robust Thermal Camera Relocalization in Large-Scale Environments

    Authors: Yu Liu, Yangtao Meng, Xianfei Pan, Jie Jiang, Changhao Chen

    Abstract: Thermal cameras capture environmental data through heat emission, a fundamentally different mechanism compared to visible light cameras, which rely on pinhole imaging. As a result, traditional visual relocalization methods designed for visible light images are not directly applicable to thermal images. Despite significant advancements in deep learning for camera relocalization, approaches specific… ▽ More

    Submitted 22 June, 2025; originally announced June 2025.

    Comments: 8 pages, 3 figures, accepted to IROS 2025

  4. arXiv:2506.18256  [pdf, ps, other

    cs.RO

    Robot Tactile Gesture Recognition Based on Full-body Modular E-skin

    Authors: Shuo Jiang, Boce Hu, Linfeng Zhao, Lawson L. S. Wong

    Abstract: With the development of robot electronic skin technology, various tactile sensors, enhanced by AI, are unlocking a new dimension of perception for robots. In this work, we explore how robots equipped with electronic skin can recognize tactile gestures and interpret them as human commands. We developed a modular robot E-skin, composed of multiple irregularly shaped skin patches, which can be assemb… ▽ More

    Submitted 22 June, 2025; originally announced June 2025.

  5. arXiv:2506.18246  [pdf, ps, other

    cs.CV

    Referring Expression Instance Retrieval and A Strong End-to-End Baseline

    Authors: Xiangzhao Hao, Kuan Zhu, Hongyu Guo, Haiyun Guo, Ning Jiang, Quan Lu, Ming Tang, Jinqiao Wang

    Abstract: Using natural language to query visual information is a fundamental need in real-world applications. Text-Image Retrieval (TIR) retrieves a target image from a gallery based on an image-level description, while Referring Expression Comprehension (REC) localizes a target object within a given image using an instance-level description. However, real-world applications often present more complex dema… ▽ More

    Submitted 26 June, 2025; v1 submitted 22 June, 2025; originally announced June 2025.

  6. CT Radiomics-Based Explainable Machine Learning Model for Accurate Differentiation of Malignant and Benign Endometrial Tumors: A Two-Center Study

    Authors: Tingrui Zhang, Honglin Wu, Zekun Jiang, Yingying Wang, Rui Ye, Huiming Ni, Chang Liu, Jin Cao, Xuan Sun, Rong Shao, Xiaorong Wei, Yingchun Sun

    Abstract: Aimed to develop and validate a CT radiomics-based explainable machine learning model for diagnosing malignancy and benignity specifically in endometrial cancer (EC) patients. A total of 83 EC patients from two centers, including 46 with malignant and 37 with benign conditions, were included, with data split into a training set (n=59) and a testing set (n=24). The regions of interest (ROIs) were m… ▽ More

    Submitted 22 June, 2025; originally announced June 2025.

    Comments: 30 pages, 5 figures, 3 tables

  7. arXiv:2506.18102  [pdf, ps, other

    cs.CL

    InspireDebate: Multi-Dimensional Subjective-Objective Evaluation-Guided Reasoning and Optimization for Debating

    Authors: Fuyu Wang, Jiangtong Li, Kun Zhu, Changjun Jiang

    Abstract: With the rapid advancements in large language models (LLMs), debating tasks, such as argument quality assessment and debate process simulation, have made significant progress. However, existing LLM-based debating systems focus on responding to specific arguments while neglecting objective assessments such as authenticity and logical validity. Furthermore, these systems lack a structured approach t… ▽ More

    Submitted 22 June, 2025; originally announced June 2025.

    Comments: 20 pages; Accepted to ACL 2025 Main

  8. arXiv:2506.18034  [pdf, ps, other

    cs.CV cs.AI cs.MM

    Pre-Trained LLM is a Semantic-Aware and Generalizable Segmentation Booster

    Authors: Fenghe Tang, Wenxin Ma, Zhiyang He, Xiaodong Tao, Zihang Jiang, S. Kevin Zhou

    Abstract: With the advancement of Large Language Model (LLM) for natural language processing, this paper presents an intriguing finding: a frozen pre-trained LLM layer can process visual tokens for medical image segmentation tasks. Specifically, we propose a simple hybrid structure that integrates a pre-trained, frozen LLM layer within the CNN encoder-decoder segmentation framework (LLM4Seg). Surprisingly,… ▽ More

    Submitted 22 June, 2025; originally announced June 2025.

    Comments: Accepted by MICCAI 2025. Code: https://github.com/FengheTan9/LLM4Seg

  9. arXiv:2506.17983  [pdf, ps, other

    eess.IV cs.CV

    LVPNet: A Latent-variable-based Prediction-driven End-to-end Framework for Lossless Compression of Medical Images

    Authors: Chenyue Song, Chen Hui, Qing Lin, Wei Zhang, Siqiao Li, Haiqi Zhu, Zhixuan Li, Shengping Zhang, Shaohui Liu, Feng Jiang, Xiang Li

    Abstract: Autoregressive Initial Bits is a framework that integrates sub-image autoregression and latent variable modeling, demonstrating its advantages in lossless medical image compression. However, in existing methods, the image segmentation process leads to an even distribution of latent variable information across each sub-image, which in turn causes posterior collapse and inefficient utilization of la… ▽ More

    Submitted 25 June, 2025; v1 submitted 22 June, 2025; originally announced June 2025.

    Comments: Accepted to MICCAI 2025

  10. arXiv:2506.17969  [pdf, ps, other

    cs.CV

    BPCLIP: A Bottom-up Image Quality Assessment from Distortion to Semantics Based on CLIP

    Authors: Chenyue Song, Chen Hui, Wei Zhang, Haiqi Zhu, Shaohui Liu, Hong Huang, Feng Jiang

    Abstract: Image Quality Assessment (IQA) aims to evaluate the perceptual quality of images based on human subjective perception. Existing methods generally combine multiscale features to achieve high performance, but most rely on straightforward linear fusion of these features, which may not adequately capture the impact of distortions on semantic content. To address this, we propose a bottom-up image quali… ▽ More

    Submitted 22 June, 2025; originally announced June 2025.

    Comments: Accepted to ICME 2025

  11. arXiv:2506.17963  [pdf, ps, other

    q-bio.BM cs.AI

    OmniESI: A unified framework for enzyme-substrate interaction prediction with progressive conditional deep learning

    Authors: Zhiwei Nie, Hongyu Zhang, Hao Jiang, Yutian Liu, Xiansong Huang, Fan Xu, Jie Fu, Zhixiang Ren, Yonghong Tian, Wen-Bin Zhang, Jie Chen

    Abstract: Understanding and modeling enzyme-substrate interactions is crucial for catalytic mechanism research, enzyme engineering, and metabolic engineering. Although a large number of predictive methods have emerged, they do not incorporate prior knowledge of enzyme catalysis to rationally modulate general protein-molecule features that are misaligned with catalytic patterns. To address this issue, we int… ▽ More

    Submitted 22 June, 2025; originally announced June 2025.

  12. arXiv:2506.17887  [pdf, ps, other

    eess.SP

    Near-Field Propagation and Spatial Non-Stationarity Channel Model for 6-24 GHz (FR3) Extremely Large-Scale MIMO: Adopted by 3GPP for 6G

    Authors: Huixin Xu, Jianhua Zhang, Pan Tang, Hongbo Xing, Haiyang Miao, Nan Zhang, Jian Li, Jianming Wu, Wenfei Yang, Zhening Zhang, Wei Jiang, Zijian He, Afshin Haghighat, Qixing Wang, Guangyi Liu

    Abstract: Next generation cellular deployments are expected to exploit the 6-24 GHz frequency range 3 (FR3) and extremely large-scale multiple-input multiple-output (XL-MIMO) to enable ultra-high data rates and reliability. However, the significantly enlarged antenna apertures and higher carrier frequencies render the far-field and spatial stationarity assumptions in the existing 3rd generation partnership… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

  13. arXiv:2506.17787  [pdf, ps, other

    cs.CV

    Incorporating Rather Than Eliminating: Achieving Fairness for Skin Disease Diagnosis Through Group-Specific Expert

    Authors: Gelei Xu, Yuying Duan, Zheyuan Liu, Xueyang Li, Meng Jiang, Michael Lemmon, Wei Jin, Yiyu Shi

    Abstract: AI-based systems have achieved high accuracy in skin disease diagnostics but often exhibit biases across demographic groups, leading to inequitable healthcare outcomes and diminished patient trust. Most existing bias mitigation methods attempt to eliminate the correlation between sensitive attributes and diagnostic prediction, but those methods often degrade performance due to the lost of clinical… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

    Comments: 11 pages, 2 figures

  14. arXiv:2506.17730  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Room-temperature intrinsic nonlinear planar Hall effect in TaIrTe$_4$

    Authors: Chang Jiang, Fan Yang, Jinshan Yang, Peng Yu, Huiying Liu, Yuda Zhang, Zehao Jia, Xiangyu Cao, Jingyi Yan, Zheng Liu, Xian-Lei Sheng, Cong Xiao, Shengyuan A. Yang, Shaoming Dong, Faxian Xiu

    Abstract: Intrinsic responses are of paramount importance in physics research, as they represent the inherent properties of materials, independent of extrinsic factors that vary from sample to sample, and often reveal the intriguing quantum geometry of the band structure. Here, we report the experimental discovery of a new intrinsic response in charge transport, specifically the intrinsic nonlinear planar H… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

  15. arXiv:2506.17661  [pdf, ps, other

    nucl-ex astro-ph.HE

    New Determination of the $^{14}$C(n, $γ$)$^{15}$C Reaction Rate and Its Astrophysical Implications

    Authors: Yuchen Jiang, Zhenyu He, Yudong Luo, Wenyu Xin, Jie Chen, Xinyue Li, Yangping Shen, Bing Guo, Guo Li, Danyang Pang, Tianli Ma, Weike Nan, Toshitaka Kajino, Weiping Liu

    Abstract: We present a novel experiment to investigate the spectroscopic factor of the $^{15}$C ground state for the first time using single-neutron $removal$ transfer reactions on $^{15}$C. Two consistent spectroscopic factors were derived from the (p, d) and (d, t) reactions, which were subsequently used to deduce the $^{14}$C(n, $γ$)$^{15}$C reaction cross section and the corresponding stellar reaction r… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

    Comments: 20 pages, 8 figures, accepted by "The Astrophysical Journal"

  16. arXiv:2506.17647  [pdf, ps, other

    cs.SE

    Improving Compiler Bug Isolation by Leveraging Large Language Models

    Authors: Yixian Qi, Jiajun Jiang, Fengjie Li, Bowen Chen, Hongyu Zhang, Junjie Chen

    Abstract: Compilers play a foundational role in building reliable software systems, and bugs within them can lead to catastrophic consequences. The compilation process typically involves hundreds of files, making traditional automated bug isolation techniques inapplicable due to scalability or effectiveness issues. Current mainstream compiler bug localization techniques have limitations in test program muta… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

    Comments: 12 pages, 7 figures

  17. arXiv:2506.17644  [pdf, ps, other

    cs.AI

    Measuring and Augmenting Large Language Models for Solving Capture-the-Flag Challenges

    Authors: Zimo Ji, Daoyuan Wu, Wenyuan Jiang, Pingchuan Ma, Zongjie Li, Shuai Wang

    Abstract: Capture-the-Flag (CTF) competitions are crucial for cybersecurity education and training. As large language models (LLMs) evolve, there is increasing interest in their ability to automate CTF challenge solving. For example, DARPA has organized the AIxCC competition since 2023 to advance AI-powered automated offense and defense. However, this demands a combination of multiple abilities, from knowle… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

  18. arXiv:2506.17589  [pdf, ps, other

    cs.AI

    Taming the Untamed: Graph-Based Knowledge Retrieval and Reasoning for MLLMs to Conquer the Unknown

    Authors: Bowen Wang, Zhouqiang Jiang, Yasuaki Susumu, Shotaro Miwa, Tianwei Chen, Yuta Nakashima

    Abstract: The real value of knowledge lies not just in its accumulation, but in its potential to be harnessed effectively to conquer the unknown. Although recent multimodal large language models (MLLMs) exhibit impressing multimodal capabilities, they often fail in rarely encountered domain-specific tasks due to limited relevant knowledge. To explore this, we adopt visual game cognition as a testbed and sel… ▽ More

    Submitted 25 June, 2025; v1 submitted 21 June, 2025; originally announced June 2025.

    Comments: Accepted by ICCV 2025

  19. arXiv:2506.17561  [pdf, ps, other

    cs.CV cs.AI cs.RO

    VLA-OS: Structuring and Dissecting Planning Representations and Paradigms in Vision-Language-Action Models

    Authors: Chongkai Gao, Zixuan Liu, Zhenghao Chi, Junshan Huang, Xin Fei, Yiwen Hou, Yuxuan Zhang, Yudi Lin, Zhirui Fang, Zeyu Jiang, Lin Shao

    Abstract: Recent studies on Vision-Language-Action (VLA) models have shifted from the end-to-end action-generation paradigm toward a pipeline involving task planning followed by action generation, demonstrating improved performance on various complex, long-horizon manipulation tasks. However, existing approaches vary significantly in terms of network architectures, planning paradigms, representations, and t… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

  20. arXiv:2506.17545  [pdf, ps, other

    cs.CV

    Scene-R1: Video-Grounded Large Language Models for 3D Scene Reasoning without 3D Annotations

    Authors: Zhihao Yuan, Shuyi Jiang, Chun-Mei Feng, Yaolun Zhang, Shuguang Cui, Zhen Li, Na Zhao

    Abstract: Currently, utilizing large language models to understand the 3D world is becoming popular. Yet existing 3D-aware LLMs act as black boxes: they output bounding boxes or textual answers without revealing how those decisions are made, and they still rely on pre-trained 3D detectors to supply object proposals. We introduce Scene-R1, a video-grounded framework that learns to reason about 3D scenes with… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

  21. arXiv:2506.17536  [pdf

    physics.med-ph cs.AI

    Exploring Strategies for Personalized Radiation Therapy Part I Unlocking Response-Related Tumor Subregions with Class Activation Mapping

    Authors: Hao Peng, Steve Jiang, Robert Timmerman

    Abstract: Personalized precision radiation therapy requires more than simple classification, it demands the identification of prognostic, spatially informative features and the ability to adapt treatment based on individual response. This study compares three approaches for predicting treatment response: standard radiomics, gradient based features, and convolutional neural networks enhanced with Class Activ… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

  22. arXiv:2506.17491  [pdf

    physics.med-ph cs.AI

    Exploring Strategies for Personalized Radiation Therapy Part II Predicting Tumor Drift Patterns with Diffusion Models

    Authors: Hao Peng, Steve Jiang, Robert Timmerman

    Abstract: Radiation therapy outcomes are decided by two key parameters, dose and timing, whose best values vary substantially across patients. This variability is especially critical in the treatment of brain cancer, where fractionated or staged stereotactic radiosurgery improves safety compared to single fraction approaches, but complicates the ability to predict treatment response. To address this challen… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

  23. arXiv:2506.17417  [pdf, ps, other

    cs.LG

    Aha Moment Revisited: Are VLMs Truly Capable of Self Verification in Inference-time Scaling?

    Authors: Mingyuan Wu, Meitang Li, Jingcheng Yang, Jize Jiang, Kaizhuo Yan, Zhaoheng Li, Minjia Zhang, Klara Nahrstedt

    Abstract: Recent advances in large language models (LLMs) have demonstrated that inference-time computation techniques, such as decoding-time scaling and self-refinement, can significantly enhance reasoning capabilities without relying on external knowledge. A key driver of this success is the emergence of self-correction and self-verification behaviors, often elicited through reinforcement learning (RL). I… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

    Comments: Work in progress

  24. arXiv:2506.17286  [pdf, ps, other

    cs.CL cs.AI

    GTA: Grouped-head latenT Attention

    Authors: Luoyang Sun, Jiwen Jiang, Cheng Deng, Xinjian Wu, Haifeng Zhang, Lei Chen, Lionel Ni, Jun Wang

    Abstract: Attention mechanisms underpin the success of large language models (LLMs), yet their substantial computational and memory overhead poses challenges for optimizing efficiency and performance. A critical bottleneck arises as KV cache and attention computations scale rapidly with text length, challenging deployment on hardware with limited computational and memory resources. We observe that attention… ▽ More

    Submitted 15 June, 2025; originally announced June 2025.

  25. arXiv:2506.17272  [pdf, ps, other

    cs.IR cs.AI

    QUST_NLP at SemEval-2025 Task 7: A Three-Stage Retrieval Framework for Monolingual and Crosslingual Fact-Checked Claim Retrieval

    Authors: Youzheng Liu, Jiyan Liu, Xiaoman Xu, Taihang Wang, Yimin Wang, Ye Jiang

    Abstract: This paper describes the participation of QUST_NLP in the SemEval-2025 Task 7. We propose a three-stage retrieval framework specifically designed for fact-checked claim retrieval. Initially, we evaluate the performance of several retrieval models and select the one that yields the best results for candidate retrieval. Next, we employ multiple re-ranking models to enhance the candidate results, wit… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

  26. Analyzing PDFs like Binaries: Adversarially Robust PDF Malware Analysis via Intermediate Representation and Language Model

    Authors: Side Liu, Jiang Ming, Guodong Zhou, Xinyi Liu, Jianming Fu, Guojun Peng

    Abstract: Malicious PDF files have emerged as a persistent threat and become a popular attack vector in web-based attacks. While machine learning-based PDF malware classifiers have shown promise, these classifiers are often susceptible to adversarial attacks, undermining their reliability. To address this issue, recent studies have aimed to enhance the robustness of PDF classifiers. Despite these efforts, t… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

    Comments: Accepted by ACM CCS 2025

  27. arXiv:2506.17125  [pdf, ps, other

    cs.SE

    Large Language Model Unlearning for Source Code

    Authors: Xue Jiang, Yihong Dong, Zheng Fang, Yingwei Ma, Tangxinyu Wang, Rongyu Cao, Binhua Li, Zhi Jin, Wenpin Jiao, Yongbin Li, Ge Li

    Abstract: LLM4SE has demonstrated significant success, but LLMs' potential memorization of sensitive or outdated training data introduces critical risks to legal compliance, software security, and code quality. LLM unlearning techniques, which can eliminate the influence of undesired data from LLMs in a post-training way, present a promising solution to address these concerns. While recent efforts in LLM un… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

  28. arXiv:2506.16962  [pdf, ps, other

    cs.CV cs.AI cs.CL

    Enhancing Step-by-Step and Verifiable Medical Reasoning in MLLMs

    Authors: Haoran Sun, Yankai Jiang, Wenjie Lou, Yujie Zhang, Wenjie Li, Lilong Wang, Mianxin Liu, Lei Liu, Xiaosong Wang

    Abstract: Multimodal large language models (MLLMs) have begun to demonstrate robust reasoning capabilities on general tasks, yet their application in the medical domain remains in its early stages. Constructing chain-of-thought (CoT) training data is essential for bolstering the reasoning abilities of medical MLLMs. However, existing approaches exhibit a deficiency in offering a comprehensive framework for… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

  29. arXiv:2506.16859  [pdf

    physics.optics

    RS-Coded Adaptive Dynamic Network for Reliable Long-Term Information Transmission in Disturbed Multimode Fiber

    Authors: Yang Hu, Minyu Fan, Kun Liu, Songsong Zhu, Nan Jiang, Sha Wang

    Abstract: Multimode fiber (MMF), due to its large core diameter and high mode capacity, holds potential in high-speed communications. However, inherent modal dispersion causes output speckle distortion, and transmission characteristics are sensitive to environmental disturbances, limiting its reliable application. Conventional transmission matrix (TM) methods face challenges such as complex calibration and… ▽ More

    Submitted 23 June, 2025; v1 submitted 20 June, 2025; originally announced June 2025.

  30. arXiv:2506.16819  [pdf, ps, other

    cs.CV cs.AI

    Loupe: A Generalizable and Adaptive Framework for Image Forgery Detection

    Authors: Yuchu Jiang, Jiaming Chu, Jian Zhao, Xin Zhang, Xu Yang, Lei Jin, Chi Zhang, Xuelong Li

    Abstract: The proliferation of generative models has raised serious concerns about visual content forgery. Existing deepfake detection methods primarily target either image-level classification or pixel-wise localization. While some achieve high accuracy, they often suffer from limited generalization across manipulation types or rely on complex architectures. In this paper, we propose Loupe, a lightweight y… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

    Comments: 6 pages, 2 figures, accepted by IJCAI 2025 workshop

  31. arXiv:2506.16760  [pdf, ps, other

    cs.CL cs.CV

    Cross-Modal Obfuscation for Jailbreak Attacks on Large Vision-Language Models

    Authors: Lei Jiang, Zixun Zhang, Zizhou Wang, Xiaobing Sun, Zhen Li, Liangli Zhen, Xiaohua Xu

    Abstract: Large Vision-Language Models (LVLMs) demonstrate exceptional performance across multimodal tasks, yet remain vulnerable to jailbreak attacks that bypass built-in safety mechanisms to elicit restricted content generation. Existing black-box jailbreak methods primarily rely on adversarial textual prompts or image perturbations, yet these approaches are highly detectable by standard content filtering… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

    Comments: 15 pages, 9 figures

  32. arXiv:2506.16749  [pdf, ps, other

    cond-mat.mtrl-sci

    Giant Magneto-Optical Effects in Two-Dimensional Flat-Band Antiferromagnets

    Authors: Ping Yang, Wanxiang Feng, Siyuan Liu, Shan Guan, Liwei Wen, Wei Jiang, Gui-Bin Liu, Yugui Yao

    Abstract: In this work, we reveal giant magneto-optical responses in two-dimensional(2D) antiferromagnets with nearly flat electronic bands, based on first-principles calculations and group-theoretical analysis. We identify a record-large second-order magneto-optical Schafer-Hubert(SH) effect, featuring a polarization rotation angle of 28 degree, in monolayer antiferromagnetic RuOCl2, driven by flatband-enh… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

    Comments: 6 pages, 3 figures

  33. arXiv:2506.16677  [pdf, ps, other

    cs.HC cs.RO

    PPTP: Performance-Guided Physiological Signal-Based Trust Prediction in Human-Robot Collaboration

    Authors: Hao Guo, Wei Fan, Shaohui Liu, Feng Jiang, Chunzhi Yi

    Abstract: Trust prediction is a key issue in human-robot collaboration, especially in construction scenarios where maintaining appropriate trust calibration is critical for safety and efficiency. This paper introduces the Performance-guided Physiological signal-based Trust Prediction (PPTP), a novel framework designed to improve trust assessment. We designed a human-robot construction scenario with three di… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

  34. arXiv:2506.16642  [pdf

    cond-mat.str-el cond-mat.mtrl-sci

    Observation of a spin-textured nematic Kondo lattice

    Authors: Yu-Xiao Jiang, Zi-Jia Cheng, Qiaozhi Xu, Md Shafayat Hossain, Xian P. Yang, Jia-Xin Yin, Maksim Litskevich, Tyler A. Cochran, Byunghoon Kim, Eduardo Miranda, Sheng Ran, Rafael M. Fernandes, M. Zahid Hasan

    Abstract: The Kondo lattice mode, as one of the most fundamental models in condensed matter physics, has been employed to describe a wide range of quantum materials such as heavy fermions, transition metal dichalcogenides and two-dimensional Moire systems. Discovering new phases on Kondo lattice and unveiling their mechanisms are crucial to the understanding of strongly correlated systems. Here, in a layere… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

  35. arXiv:2506.16504  [pdf, ps, other

    cs.CV cs.AI

    Hunyuan3D 2.5: Towards High-Fidelity 3D Assets Generation with Ultimate Details

    Authors: Zeqiang Lai, Yunfei Zhao, Haolin Liu, Zibo Zhao, Qingxiang Lin, Huiwen Shi, Xianghui Yang, Mingxin Yang, Shuhui Yang, Yifei Feng, Sheng Zhang, Xin Huang, Di Luo, Fan Yang, Fang Yang, Lifu Wang, Sicong Liu, Yixuan Tang, Yulin Cai, Zebin He, Tian Liu, Yuhong Liu, Jie Jiang, Linus, Jingwei Huang , et al. (1 additional authors not shown)

    Abstract: In this report, we present Hunyuan3D 2.5, a robust suite of 3D diffusion models aimed at generating high-fidelity and detailed textured 3D assets. Hunyuan3D 2.5 follows two-stages pipeline of its previous version Hunyuan3D 2.0, while demonstrating substantial advancements in both shape and texture generation. In terms of shape generation, we introduce a new shape foundation model -- LATTICE, which… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

    Comments: Technical report

  36. arXiv:2506.16481  [pdf, ps, other

    astro-ph.EP

    SO emission in the dynamically perturbed protoplanetary disks around CQ Tau and MWC 758

    Authors: Francesco Zagaria, Haochang Jiang, Gianni Cataldi, Stefano Facchini, Myriam Benisty, Yuri Aikawa, Sean Andrews, Jaehan Bae, Marcelo Barraza-Alfaro, Pietro Curone, Ian Czekala, Daniele Fasano, Cassandra Hall, Iain Hammond, Jane Huang, John D. Ilee, Andrés F. Izquierdo, Jensen Lawrence, Giuseppe Lodato, François Ménard, Christophe Pinte, Giovanni P. Rosotti, Jochen Stadler, Richard Teague, Leonardo Testi , et al. (3 additional authors not shown)

    Abstract: We report the serendipitous detection of the SO $J_N=6_5-5_4$ (219.949 GHz) rotational transition in archival Atacama Large Millimeter/submillimeter Array (ALMA) observations of the spiral hosting protoplanetary disks around CQ Tau (with $\approx4.9σ$ significance) and MWC 758 (with $\approx3.4σ$ significance). In the former, the SO emission comes in the shape of a ring, arises from the edge of th… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

    Comments: Accepted for publication in ApJ. 23 pages 7 figures

  37. arXiv:2506.16417  [pdf, ps, other

    quant-ph cond-mat.other physics.optics

    Collision-assisted information scrambling on a configurable photonic chip

    Authors: Xiao-Wen Shang, Shu-Yi Liang, Guan-Ju Yan, Xin-Yang Jiang, Zi-Ming Yin, Hao Tang, Jian-Peng Dou, Ze-Kun Jiang, Yu-Quan Peng, Xian-Min Jin

    Abstract: Quantum interference and entanglement are in the core of quantum computations. The fast spread of information in the quantum circuit helps to mitigate the circuit depth. Although the information scrambling in the closed systems has been proposed and tested in the digital circuits, how to measure the evolution of quantum correlations between systems and environments remains a delicate and open ques… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

    Comments: 7 pages, 4 figures. Comments are welcome!

  38. arXiv:2506.16306  [pdf, ps, other

    astro-ph.EP

    Investigating Transit Timing Variations in the Ultra-short Period Exoplanet WASP-19b

    Authors: Shraddha Biswas, Ing-Guey Jiang, Li-Chin Yeh, Hsin-Min Liu, Kaviya Parthasarathy, Devesh P. Sariya, D. Bisht, Mohit Singh Bisht, A. Raj

    Abstract: In this study, we present a comprehensive analysis of transit timing variations (TTVs) in the ultra-short-period gas giant WASP-19b, which orbits a G-type main-sequence star. Our analysis is based on a dataset comprising 204 transit light curves obtained from the Transiting Exoplanet Survey Satellite (TESS), the Exoplanet Transit Database (ETD), and the ExoClock project, supplemented by 18 publicl… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

    Comments: 35 pages, accepted for publication in AJ

  39. arXiv:2506.16304  [pdf, ps, other

    eess.SP

    A Tractable Approach to Massive Communication and Ubiquitous Connectivity in 6G Standardization

    Authors: Junyi Jiang, Wei Chen, Xin Guo, Shenghui Song, Ying Jun, Zhang, Zhu Han, Merouane Debbah, Khaled B. Letaief

    Abstract: The full-scale 6G standardization has attracted considerable recent attention, especially since the first 3GPP-wide 6G workshop held in March 2025. To understand the practical and fundamental values of 6G and facilitate its standardization, it is crucial to explore the theoretical limits of spectrum, energy, and coverage efficiency considering practical hardware and signaling constraints. In this… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

  40. arXiv:2506.16273  [pdf, ps, other

    cs.CV cs.MM

    Fine-grained Image Retrieval via Dual-Vision Adaptation

    Authors: Xin Jiang, Meiqi Cao, Hao Tang, Fei Shen, Zechao Li

    Abstract: Fine-Grained Image Retrieval~(FGIR) faces challenges in learning discriminative visual representations to retrieve images with similar fine-grained features. Current leading FGIR solutions typically follow two regimes: enforce pairwise similarity constraints in the semantic embedding space, or incorporate a localization sub-network to fine-tune the entire model. However, such two regimes tend to o… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

  41. arXiv:2506.16263  [pdf, ps, other

    cs.RO cs.AI

    CapsDT: Diffusion-Transformer for Capsule Robot Manipulation

    Authors: Xiting He, Mingwu Su, Xinqi Jiang, Long Bai, Jiewen Lai, Hongliang Ren

    Abstract: Vision-Language-Action (VLA) models have emerged as a prominent research area, showcasing significant potential across a variety of applications. However, their performance in endoscopy robotics, particularly endoscopy capsule robots that perform actions within the digestive system, remains unexplored. The integration of VLA models into endoscopy robots allows more intuitive and efficient interact… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

    Comments: IROS 2025

  42. arXiv:2506.16199  [pdf

    cs.HC

    Development of a persuasive User Experience Research (UXR) Point of View for Explainable Artificial Intelligence (XAI)

    Authors: Mohammad Naiseh, Huseyin Dogan, Stephen Giff, Nan Jiang

    Abstract: Explainable Artificial Intelligence (XAI) plays a critical role in fostering user trust and understanding in AI-driven systems. However, the design of effective XAI interfaces presents significant challenges, particularly for UX professionals who may lack technical expertise in AI or machine learning. Existing explanation methods, such as SHAP, LIME, and counterfactual explanations, often rely on… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

  43. arXiv:2506.16173  [pdf, ps, other

    cs.RO cs.SD eess.AS

    Single-Microphone-Based Sound Source Localization for Mobile Robots in Reverberant Environments

    Authors: Jiang Wang, Runwu Shi, Benjamin Yen, He Kong, Kazuhiro Nakadai

    Abstract: Accurately estimating sound source positions is crucial for robot audition. However, existing sound source localization methods typically rely on a microphone array with at least two spatially preconfigured microphones. This requirement hinders the applicability of microphone-based robot audition systems and technologies. To alleviate these challenges, we propose an online sound source localizatio… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

    Comments: This paper was accepted and going to appear in the 2025 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  44. arXiv:2506.16101  [pdf, ps, other

    cs.SE

    Regression Testing Optimization for ROS-based Autonomous Systems: A Comprehensive Review of Techniques

    Authors: Yupeng Jiang, Shuaiyi Sun, Xi Zheng

    Abstract: Regression testing plays a critical role in maintaining software reliability, particularly for ROS-based autonomous systems (ROSAS), which frequently undergo continuous integration and iterative development. However, conventional regression testing techniques face significant challenges when applied to autonomous systems due to their dynamic and non-deterministic behaviors, complex multi-modal sen… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

  45. arXiv:2506.16012  [pdf, ps, other

    cs.RO

    DualTHOR: A Dual-Arm Humanoid Simulation Platform for Contingency-Aware Planning

    Authors: Boyu Li, Siyuan He, Hang Xu, Haoqi Yuan, Yu Zang, Liwei Hu, Junpeng Yue, Zhenxiong Jiang, Pengbo Hu, Börje F. Karlsson, Yehui Tang, Zongqing Lu

    Abstract: Developing embodied agents capable of performing complex interactive tasks in real-world scenarios remains a fundamental challenge in embodied AI. Although recent advances in simulation platforms have greatly enhanced task diversity to train embodied Vision Language Models (VLMs), most platforms rely on simplified robot morphologies and bypass the stochastic nature of low-level execution, which li… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

  46. arXiv:2506.15980  [pdf, ps, other

    cs.CV cs.AI

    Advanced Sign Language Video Generation with Compressed and Quantized Multi-Condition Tokenization

    Authors: Cong Wang, Zexuan Deng, Zhiwei Jiang, Fei Shen, Yafeng Yin, Shiwei Gan, Zifeng Cheng, Shiping Ge, Qing Gu

    Abstract: Sign Language Video Generation (SLVG) seeks to generate identity-preserving sign language videos from spoken language texts. Existing methods primarily rely on the single coarse condition (\eg, skeleton sequences) as the intermediary to bridge the translation model and the video generation model, which limits both the naturalness and expressiveness of the generated videos. To overcome these limita… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

  47. arXiv:2506.15898  [pdf, ps, other

    cs.LG

    TrajDiff: Diffusion Bridge Network with Semantic Alignment for Trajectory Similarity Computation

    Authors: Xiao Zhang, Xingyu Zhao, Hong Xia, Yuan Cao, Guiyuan Jiang, Junyu Dong, Yanwei Yu

    Abstract: With the proliferation of location-tracking technologies, massive volumes of trajectory data are continuously being collected. As a fundamental task in trajectory data mining, trajectory similarity computation plays a critical role in a wide range of real-world applications. However, existing learning-based methods face three challenges: First, they ignore the semantic gap between GPS and grid fea… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

  48. arXiv:2506.15741  [pdf, ps, other

    cs.AI cs.CL

    OAgents: An Empirical Study of Building Effective Agents

    Authors: He Zhu, Tianrui Qin, King Zhu, Heyuan Huang, Yeyi Guan, Jinxiang Xia, Yi Yao, Hanhao Li, Ningning Wang, Pai Liu, Tianhao Peng, Xin Gui, Xiaowan Li, Yuhui Liu, Yuchen Eleanor Jiang, Jun Wang, Changwang Zhang, Xiangru Tang, Ge Zhang, Jian Yang, Minghao Liu, Xitong Gao, Jiaheng Liu, Wangchunshu Zhou

    Abstract: Recently, Agentic AI has become an increasingly popular research field. However, we argue that current agent research practices lack standardization and scientific rigor, making it hard to conduct fair comparisons among methods. As a result, it is still unclear how different design choices in agent frameworks affect effectiveness, and measuring their progress remains challenging. In this work, we… ▽ More

    Submitted 23 June, 2025; v1 submitted 17 June, 2025; originally announced June 2025.

    Comments: 28 pages

  49. arXiv:2506.15724  [pdf, ps, other

    cs.LG cs.AI cs.CL

    MadaKV: Adaptive Modality-Perception KV Cache Eviction for Efficient Multimodal Long-Context Inference

    Authors: Kunxi Li, Zhonghua Jiang, Zhouzhou Shen, Zhaode Wang, Chengfei Lv, Shengyu Zhang, Fan Wu, Fei Wu

    Abstract: This paper introduces MadaKV, a modality-adaptive key-value (KV) cache eviction strategy designed to enhance the efficiency of multimodal large language models (MLLMs) in long-context inference. In multimodal scenarios, attention heads exhibit varying preferences for different modalities, resulting in significant disparities in modality importance across attention heads. Traditional KV cache evict… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

  50. arXiv:2506.15712  [pdf, ps, other

    cs.LG cs.AI

    BatteryBERT for Realistic Battery Fault Detection Using Point-Masked Signal Modeling

    Authors: Songqi Zhou, Ruixue Liu, Yixing Wang, Jia Lu, Benben Jiang

    Abstract: Accurate fault detection in lithium-ion batteries is essential for the safe and reliable operation of electric vehicles and energy storage systems. However, existing methods often struggle to capture complex temporal dependencies and cannot fully leverage abundant unlabeled data. Although large language models (LLMs) exhibit strong representation capabilities, their architectures are not directly… ▽ More

    Submitted 31 May, 2025; originally announced June 2025.