Skip to main content

Showing 51–100 of 26,487 results for author: Jiang

.
  1. arXiv:2506.21022  [pdf, ps, other

    cs.CV

    Instella-T2I: Pushing the Limits of 1D Discrete Latent Space Image Generation

    Authors: Ze Wang, Hao Chen, Benran Hu, Jiang Liu, Ximeng Sun, Jialian Wu, Yusheng Su, Xiaodong Yu, Emad Barsoum, Zicheng Liu

    Abstract: Image tokenization plays a critical role in reducing the computational demands of modeling high-resolution images, significantly improving the efficiency of image and multimodal understanding and generation. Recent advances in 1D latent spaces have reduced the number of tokens required by eliminating the need for a 2D grid structure. In this paper, we further advance compact discrete image represe… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

  2. arXiv:2506.20997  [pdf, ps, other

    astro-ph.GA

    A Glimpse of Satellite Galaxies in the Milky Way with the 2.5-meter Wide Field Survey Telescope (WFST): Bootes III and Draco

    Authors: Chao Yang, Zhizheng Pan, Min Fang, Xian Zhong Zheng, Binyang Liu, Guoliang Li, Tian-Rui Sun, Ji-An Jiang, Miaomiao Zhang, Zhen Wan, Shuang Liu, Han Qu, Ji Yang, Xu Kong, Wenhao Liu, Yiping Shu, Jiang Chang, Tinggui Wang, Lulu Fan, Yongquan Xue, Wentao Luo, Hongxin Zhang, Zheng Lou, Haibin Zhao, Bin Li , et al. (12 additional authors not shown)

    Abstract: We carry out deep imaging of the Milky Way satellite galaxies, Bootes III and Draco, with WFST as one pilot observing program to demonstrate the capability of WFST. Combining catalogs with PS1 DR2 and Gaia DR3, we derive proper motions for candidate member stars in these two satellite galaxies over a 12-year time baseline, yielding uncertainties of ~1.8 mas/yr at 21 mag and ~3.0 mas/yr at 22 mag i… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: 17 pages, 12 figures, 3 tables. Accepted for publication in ApJ

  3. arXiv:2506.20981  [pdf, ps, other

    cs.CR

    PrivacyGo: Privacy-Preserving Ad Measurement with Multidimensional Intersection

    Authors: Jian Du, Haohao Qian, Shikun Zhang, Wen-jie Lu, Donghang Lu, Yongchuan Niu, Bo Jiang, Yongjun Zhao, Qiang Yan

    Abstract: This paper tackles the challenging and practical problem of multi-identifier private user profile matching for privacy-preserving ad measurement, a cornerstone of modern advertising analytics. We introduce a comprehensive cryptographic framework leveraging reversed Oblivious Pseudorandom Functions (OPRF) and novel blind key rotation techniques to support secure matching across multiple identifiers… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

  4. arXiv:2506.20963  [pdf, ps, other

    cs.IR cs.LG

    EraRAG: Efficient and Incremental Retrieval Augmented Generation for Growing Corpora

    Authors: Fangyuan Zhang, Zhengjun Huang, Yingli Zhou, Qintian Guo, Zhixun Li, Wensheng Luo, Di Jiang, Yixiang Fang, Xiaofang Zhou

    Abstract: Graph-based Retrieval-Augmented Generation (Graph-RAG) enhances large language models (LLMs) by structuring retrieval over an external corpus. However, existing approaches typically assume a static corpus, requiring expensive full-graph reconstruction whenever new documents arrive, limiting their scalability in dynamic, evolving environments. To address these limitations, we introduce EraRAG, a no… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

    Comments: Under review

  5. arXiv:2506.20960  [pdf, ps, other

    cs.CV cs.AI

    OmniEval: A Benchmark for Evaluating Omni-modal Models with Visual, Auditory, and Textual Inputs

    Authors: Yiman Zhang, Ziheng Luo, Qiangyu Yan, Wei He, Borui Jiang, Xinghao Chen, Kai Han

    Abstract: In this paper, we introduce OmniEval, a benchmark for evaluating omni-modality models like MiniCPM-O 2.6, which encompasses visual, auditory, and textual inputs. Compared with existing benchmarks, our OmniEval has several distinctive features: (i) Full-modal collaboration: We design evaluation tasks that highlight the strong coupling between audio and video, requiring models to effectively leverag… ▽ More

    Submitted 29 June, 2025; v1 submitted 25 June, 2025; originally announced June 2025.

  6. A Framework for Building Data Structures from Communication Protocols

    Authors: Alexandr Andoni, Shunhua Jiang, Omri Weinstein

    Abstract: We present a general framework for designing efficient data structures for high-dimensional pattern-matching problems ($\exists \;? i\in[n], f(x_i,y)=1$) through communication models in which $f(x,y)$ admits sublinear communication protocols with exponentially-small error. Specifically, we reduce the data structure problem to the Unambiguous Arthur-Merlin (UAM) communication complexity of… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

    Comments: 53 pages, STOC 2025

  7. arXiv:2506.20558  [pdf, ps, other

    cs.SE

    CCISolver: End-to-End Detection and Repair of Method-Level Code-Comment Inconsistency

    Authors: Renyi Zhong, Yintong Huo, Wenwei Gu, Jinxi Kuang, Zhihan Jiang, Guangba Yu, Yichen Li, David Lo, Michael R. Lyu

    Abstract: Comments within code serve as a crucial foundation for software documentation, facilitating developers to communicate and understand the code effectively. However, code-comment inconsistency (CCI) can negatively affect software development, testing, and maintenance. Recent efforts to mitigate this issue have emerged, but existing studies often suffer from inaccurate datasets and inadequate solutio… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

    Comments: This manuscript is under review

  8. arXiv:2506.20532  [pdf, ps, other

    astro-ph.HE

    {\tt RapidGBM}: An Efficient Tool for Fermi-GBM Visibility Checking and Data Analysis with a Case Study of EP240617a

    Authors: Yun Wang, Jia Ren, Lu-Yao Jiang, Hao Zhou, Yi-Han Iris Yin, Yi-Fang Liang, Zhi-Ping Jin, Yi-Zhong Fan, Da-Ming Wei, Wei Chen, Hui Sun, Jing-Wei Hu, Dong-Yue Li, Jun Yang, Wen-Da Zhang, Yuan Liu, Wei-Min Yuan, Xue-Feng Wu

    Abstract: We have developed a lightweight tool {\tt RapidGBM}, featured by a web-based interface and capabilities of rapid calculation of Fermi-GBM visibilities and performing basic data analysis. It has two key features: (1) immediately check the visibility of Fermi-GBM for new transients, and (2) check the light curve and perform spectral analysis after the hourly TTE data is released. The visibility chec… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

    Comments: 11 pages, 7 figures, 1 table

  9. arXiv:2506.20510  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    High-temperature helical edge states in BiSbTeSe$_2$/graphene van der Waals heterostructure

    Authors: Yoichi Tanabe, Ngoc Han Tu, Ming-Chun Jiang, Yi Ling Chiew, Mitsutaka Haruta, Kiyohiro Adachi, David Pomaranski, Ryo Ito, Yuya Shimazaki, Daisuke Hashizume, Xiuzhen Yu, Guang-Yu Guo, Ryotaro Arita, Michihisa Yamamoto

    Abstract: Van der Waals heterostructures have been used to tailor atomic layers into various artificial materials through interactions at heterointerfaces. The interplay between the band gap created by the band folding of the interfacial potential and the band inversion driven by enhanced spin-orbit interaction (SOI) through band hybridization enables us to realize a two-dimensional topological insulator (2… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

    Comments: 31pages, 4 figures, and 7 supporting figures

  10. arXiv:2506.20502  [pdf

    astro-ph.SR physics.space-ph

    Probing Solar Polar Regions

    Authors: Yuanyong Deng, Hui Tian, Jie Jiang, Shuhong Yang, Hao Li, Robert Cameron, Laurent Gizon, Louise Harra, Robert F. Wimmer-Schweingruber, Frédéric Auchère, Xianyong Bai, Luis Bellot Rubio, Linjie Chen, Pengfei Chen, Lakshmi Pradeep Chitta, Jackie Davies, Fabio Favata, Li Feng, Xueshang Feng, Weiqun Gan, Don Hassler, Jiansen He, Junfeng Hou, Zhenyong Hou, Chunlan Jin , et al. (23 additional authors not shown)

    Abstract: The magnetic fields and dynamical processes in the solar polar regions play a crucial role in the solar magnetic cycle and in supplying mass and energy to the fast solar wind, ultimately being vital in controlling solar activities and driving space weather. Despite numerous efforts to explore these regions, to date no imaging observations of the Sun's poles have been achieved from vantage points o… ▽ More

    Submitted 28 June, 2025; v1 submitted 25 June, 2025; originally announced June 2025.

    Comments: Accepted for publication in Chinese Journal of Space Science

  11. arXiv:2506.20479  [pdf, ps, other

    astro-ph.GA

    The MALATANG survey: Dense gas distribution on sub-kiloparsec scales across the disk of M82

    Authors: Jian-Fa Wang, Yu Gao, Qing-Hua Tan, Xue-Jian Jiang, Li Ji, Zhi-Yu Zhang, Jun-Zhi Wang, Jun-Feng Wang, R. Thomas Greve, Yan Jiang, Ashley Bemis, Elias Brinks, Aeree Chung, J. Malcolm Currie, Richard de Grijs, Taotao Fang, C. Luis Ho, Bumhyun Lee, Satoki Matsushita, Michał Michałowski, Soojong Pak, Panomporn Poojon, G. Mark Rawlings, Amelie Saintonge, Yi-Chen Sun , et al. (1 additional authors not shown)

    Abstract: We present observations of HCN J=4-3 and HCO^+ J=4-3 lines obtained with the James Clerk Maxwell Telescope as part of the MALATANG survey, combined with archival HCN J=1-0 and HCO^+ J=1-0 data from the Green Bank Telescope, to study the spatial distribution and excitation conditions of dense molecular gas in the disk of M82. We detect HCN J=4-3 and HCO^+ J=4-3 emission within the central region (<… ▽ More

    Submitted 26 June, 2025; v1 submitted 25 June, 2025; originally announced June 2025.

  12. arXiv:2506.20414  [pdf, ps, other

    nucl-th

    Shell effects in nuclear charge radii based on Skyrme density functionals

    Authors: Rong An, Shuai Sun, Xiang Jiang, Na Tang, Li-Gang Cao, Feng-Shou Zhang

    Abstract: A unified description of the charge radii throughout the entire nuclide chart plays an essential role for our understanding of nuclear structure and fundamental nuclear interactions. In this work, the influence of new term, which catches the spirit of neutron and proton pairs condensation around Fermi surface, on the charge radii has been investigated based on the Skyrme density functionals with t… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

    Comments: 29 pages, 4 figures, 1 table, To be appeared in Physical Review C

  13. arXiv:2506.20392  [pdf

    cond-mat.str-el

    Transport Evidence for Wigner Crystals in Monolayer MoTe2

    Authors: Mingjie Zhang, Zhenyu Wang, Yifan Jiang, Yaotian Liu, Kenji Watanabe, Takashi Taniguchi, Song Liu, Shiming Lei, Yongqing Li, Yang Xu

    Abstract: The crystallization of charge carriers, dubbed the Wigner crystal, is anticipated at low densities in clean two-dimensional electronic systems (2DES). While there has been extensive investigation across diverse platforms, probing spontaneous charge and spin ordering is hindered by disorder effects and limited interaction energies. Here, we report transport evidence for Wigner crystals with antifer… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

    Comments: 25 pages, 4 figures and 8 supplemental figures

  14. arXiv:2506.20344  [pdf, ps, other

    math.OC cs.LG

    A Complete Loss Landscape Analysis of Regularized Deep Matrix Factorization

    Authors: Po Chen, Rujun Jiang, Peng Wang

    Abstract: Despite its wide range of applications across various domains, the optimization foundations of deep matrix factorization (DMF) remain largely open. In this work, we aim to fill this gap by conducting a comprehensive study of the loss landscape of the regularized DMF problem. Toward this goal, we first provide a closed-form expression of all critical points. Building on this, we establish precise c… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

    Comments: 35 pages, 3 figures

  15. arXiv:2506.20332  [pdf, ps, other

    cs.AI

    Mobile-R1: Towards Interactive Reinforcement Learning for VLM-Based Mobile Agent via Task-Level Rewards

    Authors: Jihao Gu, Qihang Ai, Yingyao Wang, Pi Bu, Jingxuan Xing, Zekun Zhu, Wei Jiang, Ziming Wang, Yingxiu Zhao, Ming-Liang Zhang, Jun Song, Yuning Jiang, Bo Zheng

    Abstract: Vision-language model-based mobile agents have gained the ability to not only understand complex instructions and mobile screenshots, but also optimize their action outputs via thinking and reasoning, benefiting from reinforcement learning, such as Group Relative Policy Optimization (GRPO). However, existing research centers on offline reinforcement learning training or online optimization using a… ▽ More

    Submitted 27 June, 2025; v1 submitted 25 June, 2025; originally announced June 2025.

    Comments: 14 pages, 12 figures

  16. arXiv:2506.20265  [pdf, ps, other

    cond-mat.mtrl-sci cond-mat.mes-hall cond-mat.supr-con

    Two-dimensional transition metal selenides family M2Se: A platform for superconductivity, band topology, and charge density waves

    Authors: Shu-Xiang Qiao, Kai-Yue Jiang, Yu-Lin Han, Na Jiao, Ying-Jie Chen, Hong-Yan Lu, Ping Zhang

    Abstract: MXenes and MBenes, which are two-dimensional (2D) transition metal carbides/nitrides and borides, have been extensively studied for their impressive properties. Recently, we reported a family of transition metal sulfides MSene (M2S) with rich properties [Phys. Rev. B 111, L041404 (2025)], it is worth studying whether selenides with similar structure also have rich properties. In this work, through… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

    Comments: 10 pages, 5 figures

  17. arXiv:2506.20263  [pdf, ps, other

    cs.CV

    Hierarchical Mask-Enhanced Dual Reconstruction Network for Few-Shot Fine-Grained Image Classification

    Authors: Ning Luo, Meiyin Hu, Huan Wan, Yanyan Yang, Zhuohang Jiang, Xin Wei

    Abstract: Few-shot fine-grained image classification (FS-FGIC) presents a significant challenge, requiring models to distinguish visually similar subclasses with limited labeled examples. Existing methods have critical limitations: metric-based methods lose spatial information and misalign local features, while reconstruction-based methods fail to utilize hierarchical feature information and lack mechanisms… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

  18. arXiv:2506.20260  [pdf, ps, other

    cs.LG cs.AI cs.MA

    Argumentative Ensembling for Robust Recourse under Model Multiplicity

    Authors: Junqi Jiang, Antonio Rago, Francesco Leofante, Francesca Toni

    Abstract: In machine learning, it is common to obtain multiple equally performing models for the same prediction task, e.g., when training neural networks with different random seeds. Model multiplicity (MM) is the situation which arises when these competing models differ in their predictions for the same input, for which ensembling is often employed to determine an aggregation of the outputs. Providing rec… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

    Comments: arXiv admin note: substantial text overlap with arXiv:2312.15097

  19. arXiv:2506.20200  [pdf, ps, other

    eess.IV cs.CV

    MS-IQA: A Multi-Scale Feature Fusion Network for PET/CT Image Quality Assessment

    Authors: Siqiao Li, Chen Hui, Wei Zhang, Rui Liang, Chenyue Song, Feng Jiang, Haiqi Zhu, Zhixuan Li, Hong Huang, Xiang Li

    Abstract: Positron Emission Tomography / Computed Tomography (PET/CT) plays a critical role in medical imaging, combining functional and anatomical information to aid in accurate diagnosis. However, image quality degradation due to noise, compression and other factors could potentially lead to diagnostic uncertainty and increase the risk of misdiagnosis. When evaluating the quality of a PET/CT image, both l… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

    Comments: Accepted to MICCAI 2025

  20. arXiv:2506.20109  [pdf, ps, other

    cs.CR

    Evaluating Disassembly Errors With Only Binaries

    Authors: Lambang Akbar Wijayadi, Yuancheng Jiang, Roland H. C. Yap, Zhenkai Liang, Zhuohao Liu

    Abstract: Disassemblers are crucial in the analysis and modification of binaries. Existing works showing disassembler errors largely rely on practical implementation without specific guarantees and assume source code and compiler toolchains to evaluate ground truth. However, the assumption of source code is contrary to typical binary scenarios where only the binary is available. In this work, we investigate… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

  21. arXiv:2506.20103  [pdf, ps, other

    cs.CV cs.AI

    BrokenVideos: A Benchmark Dataset for Fine-Grained Artifact Localization in AI-Generated Videos

    Authors: Jiahao Lin, Weixuan Peng, Bojia Zi, Yifeng Gao, Xianbiao Qi, Xingjun Ma, Yu-Gang Jiang

    Abstract: Recent advances in deep generative models have led to significant progress in video generation, yet the fidelity of AI-generated videos remains limited. Synthesized content often exhibits visual artifacts such as temporally inconsistent motion, physically implausible trajectories, unnatural object deformations, and local blurring that undermine realism and user trust. Accurate detection and spatia… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

    Comments: 7 page,4 figures,2 tables

    ACM Class: I.4

  22. arXiv:2506.20095  [pdf, ps, other

    hep-th

    Solving Infinite Families of Dual Conformal Integrals and Periods

    Authors: Song He, Xuhang Jiang

    Abstract: We compute infinite families of all-loop planar, dual conformal invariant (DCI) integrals, which contribute to four-point Coulomb-branch amplitudes and correlators in ${\cal N}=4$ supersymmetric Yang-Mills theory, by solving ``boxing" differential equations via package HyperlogProcedures; this amounts to an ``inverse-boxing" operation/integration recursively acting on lower-loop cases (with the bo… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

    Comments: 44 pages, many figures

  23. arXiv:2506.19884  [pdf, ps, other

    cs.OS cs.AI cs.PF cs.SE

    MNN-AECS: Energy Optimization for LLM Decoding on Mobile Devices via Adaptive Core Selection

    Authors: Zhengxiang Huang, Chaoyue Niu, Zhaode Wang, Jiarui Xue, Hanming Zhang, Yugang Wang, Zewei Xin, Xiaotang Jiang, Chengfei Lv, Fan Wu, Guihai Chen

    Abstract: As the demand for on-device Large Language Model (LLM) inference grows, energy efficiency has become a major concern, especially for battery-limited mobile devices. Our analysis shows that the memory-bound LLM decode phase dominates energy use, and yet most existing works focus on accelerating the prefill phase, neglecting energy concerns. We introduce Adaptive Energy-Centric Core Selection (AECS)… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

  24. arXiv:2506.19874  [pdf, ps, other

    cs.CR cs.AI

    Towards Provable (In)Secure Model Weight Release Schemes

    Authors: Xin Yang, Bintao Tang, Yuhao Wang, Zimo Ji, Terry Jingchen Zhang, Wenyuan Jiang

    Abstract: Recent secure weight release schemes claim to enable open-source model distribution while protecting model ownership and preventing misuse. However, these approaches lack rigorous security foundations and provide only informal security guarantees. Inspired by established works in cryptography, we formalize the security of weight release schemes by introducing several concrete security definitions.… ▽ More

    Submitted 26 June, 2025; v1 submitted 23 June, 2025; originally announced June 2025.

    Comments: 8 pages, 2 figures; author name typos and institutions corrected

  25. arXiv:2506.19707  [pdf, ps, other

    quant-ph

    Enhanced Image Recognition Using Gaussian Boson Sampling

    Authors: Si-Qiu Gong, Ming-Cheng Chen, Hua-Liang Liu, Hao Su, Yi-Chao Gu, Hao-Yang Tang, Meng-Hao Jia, Yu-Hao Deng, Qian Wei, Hui Wang, Han-Sen Zhong, Xiao Jiang, Li Li, Nai-Le Liu, Chao-Yang Lu, Jian-Wei Pan

    Abstract: Gaussian boson sampling (GBS) has emerged as a promising quantum computing paradigm, demonstrating its potential in various applications. However, most existing works focus on theoretical aspects or simple tasks, with limited exploration of its capabilities in solving real-world practical problems. In this work, we propose a novel GBS-based image recognition scheme inspired by extreme learning mac… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

  26. arXiv:2506.19694  [pdf, ps, other

    cs.CV

    UltraAD: Fine-Grained Ultrasound Anomaly Classification via Few-Shot CLIP Adaptation

    Authors: Yue Zhou, Yuan Bi, Wenjuan Tong, Wei Wang, Nassir Navab, Zhongliang Jiang

    Abstract: Precise anomaly detection in medical images is critical for clinical decision-making. While recent unsupervised or semi-supervised anomaly detection methods trained on large-scale normal data show promising results, they lack fine-grained differentiation, such as benign vs. malignant tumors. Additionally, ultrasound (US) imaging is highly sensitive to devices and acquisition parameter variations,… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

  27. arXiv:2506.19683  [pdf, ps, other

    cs.CV cs.AI cs.LG eess.IV

    Semantic Scene Graph for Ultrasound Image Explanation and Scanning Guidance

    Authors: Xuesong Li, Dianye Huang, Yameng Zhang, Nassir Navab, Zhongliang Jiang

    Abstract: Understanding medical ultrasound imaging remains a long-standing challenge due to significant visual variability caused by differences in imaging and acquisition parameters. Recent advancements in large language models (LLMs) have been used to automatically generate terminology-rich summaries orientated to clinicians with sufficient physiological knowledge. Nevertheless, the increasing demand for… ▽ More

    Submitted 26 June, 2025; v1 submitted 24 June, 2025; originally announced June 2025.

  28. arXiv:2506.19681  [pdf, ps, other

    cs.CV

    Genome-Anchored Foundation Model Embeddings Improve Molecular Prediction from Histology Images

    Authors: Cheng Jin, Fengtao Zhou, Yunfang Yu, Jiabo Ma, Yihui Wang, Yingxue Xu, Huajun Zhou, Hao Jiang, Luyang Luo, Luhui Mao, Zifan He, Xiuming Zhang, Jing Zhang, Ronald Chan, Herui Yao, Hao Chen

    Abstract: Precision oncology requires accurate molecular insights, yet obtaining these directly from genomics is costly and time-consuming for broad clinical use. Predicting complex molecular features and patient prognosis directly from routine whole-slide images (WSI) remains a major challenge for current deep learning methods. Here we introduce PathLUPI, which uses transcriptomic privileged information du… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

    Comments: Under Review

  29. arXiv:2506.19643  [pdf, ps, other

    cs.LG

    Unsupervised Data Generation for Offline Reinforcement Learning: A Perspective from Model

    Authors: Shuncheng He, Hongchang Zhang, Jianzhun Shao, Yuhang Jiang, Xiangyang Ji

    Abstract: Offline reinforcement learning (RL) recently gains growing interests from RL researchers. However, the performance of offline RL suffers from the out-of-distribution problem, which can be corrected by feedback in online RL. Previous offline RL research focuses on restricting the offline algorithm in in-distribution even in-sample action sampling. In contrast, fewer work pays attention to the influ… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

  30. arXiv:2506.19500  [pdf, ps, other

    cs.AI cs.CL cs.LG

    NaviAgent: Bilevel Planning on Tool Dependency Graphs for Function Calling

    Authors: Yan Jiang, Hao Zhou, LiZhong GU, Ai Han, TianLong Li

    Abstract: LLMs' reliance on static knowledge and fragile tool invocation severely hinders the orchestration of complex, heterogeneous toolchains, particularly at large scales. Existing methods typically use rigid single-path execution, resulting in poor error recovery and exponentially growing search spaces. We introduce NaviAgent, a graph-navigated bilevel planning architecture for robust function calling,… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

  31. arXiv:2506.19488  [pdf, ps, other

    cs.CV

    SceneCrafter: Controllable Multi-View Driving Scene Editing

    Authors: Zehao Zhu, Yuliang Zou, Chiyu Max Jiang, Bo Sun, Vincent Casser, Xiukun Huang, Jiahao Wang, Zhenpei Yang, Ruiqi Gao, Leonidas Guibas, Mingxing Tan, Dragomir Anguelov

    Abstract: Simulation is crucial for developing and evaluating autonomous vehicle (AV) systems. Recent literature builds on a new generation of generative models to synthesize highly realistic images for full-stack simulation. However, purely synthetically generated scenes are not grounded in reality and have difficulty in inspiring confidence in the relevance of its outcomes. Editing models, on the other ha… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

    Comments: CVPR 2025

  32. arXiv:2506.19449  [pdf, ps, other

    physics.optics

    A broadband platform to search for hidden photons

    Authors: Daqing Liu, Bin Tang, Xingfang Jiang, Xianyun Liu, Ning Ma

    Abstract: The optical behavior of a structure consisting of graphene sheets embedded in media was studied, and the differences between the structure and ordinary birefringent crystal, double zero-reflectance point, were identified. We showed the changes in the optical behavior of the structure due to the existence of hidden photons. When a radiation illuminates the structure, only… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

    Comments: 8 pages, 5 figures

  33. arXiv:2506.19425  [pdf, ps, other

    cs.SE

    What Makes the Best Decomposition? Investigating Binary Decomposition Under FCG Variance

    Authors: Ang Jia, He Jiang, Zhilei Ren, Xiaochen Li, Ming Fan, Ting Liu

    Abstract: Binary decomposition, which decomposes binary files into modules, plays a critical role in binary reuse detection. Existing binary decomposition works either apply anchor-based methods by extending anchor functions to generate modules, or apply clustering-based methods by using clustering algorithms to group binary functions, which all rely on that reused code shares similar function call relation… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

  34. arXiv:2506.19368  [pdf, ps, other

    cs.CR

    Yotta: A Large-Scale Trustless Data Trading Scheme for Blockchain System

    Authors: Xiang Liu, Zhanpeng Guo, Liangxi Liu, Mengyao Zheng, Yiming Qiu, Linshan Jiang

    Abstract: Data trading is one of the key focuses of Web 3.0. However, all the current methods that rely on blockchain-based smart contracts for data exchange cannot support large-scale data trading while ensuring data security, which falls short of fulfilling the spirit of Web 3.0. Even worse, there is currently a lack of discussion on the essential properties that large-scale data trading should satisfy. I… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

    Comments: 9 pages, 2 figures, Exploratory Paper

    Journal ref: Nanyang Blockchain Conference 2025

  35. arXiv:2506.19296  [pdf, ps, other

    cs.LG

    The Effect of Depth on the Expressivity of Deep Linear State-Space Models

    Authors: Zeyu Bao, Penghao Yu, Haotian Jiang, Qianxiao Li

    Abstract: Deep state-space models (SSMs) have gained increasing popularity in sequence modelling. While there are numerous theoretical investigations of shallow SSMs, how the depth of the SSM affects its expressiveness remains a crucial problem. In this paper, we systematically investigate the role of depth and width in deep linear SSMs, aiming to characterize how they influence the expressive capacity of t… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

  36. arXiv:2506.19257  [pdf, ps, other

    cs.CV cs.CL

    MSR-Align: Policy-Grounded Multimodal Alignment for Safety-Aware Reasoning in Vision-Language Models

    Authors: Yinan Xia, Yilei Jiang, Yingshui Tan, Xiaoyong Zhu, Xiangyu Yue, Bo Zheng

    Abstract: Vision-Language Models (VLMs) have achieved remarkable progress in multimodal reasoning tasks through enhanced chain-of-thought capabilities. However, this advancement also introduces novel safety risks, as these models become increasingly vulnerable to harmful multimodal prompts that can trigger unethical or unsafe behaviors. Existing safety alignment approaches, primarily designed for unimodal l… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

  37. arXiv:2506.19205  [pdf

    physics.app-ph cond-mat.mtrl-sci

    Deposition-Dependent Coverage and Performance of Phosphonic Acid Interface Modifiers in Halide Perovskite Optoelectronics

    Authors: Hannah Contreras, Aidan O'Brien, Margherita Taddei, Yangwei Shi, Fangyuan Jiang, Robert J. E. Westbrook, Yadong Zhang, Rajiv Giridharagopal, Paul A. Lee, Stephen Barlow, Seth R. Marder, Neal R. Armstrong, David S. Ginger

    Abstract: In this work, we study the effect of various deposition methods for phosphonic acid interface modifiers commonly pursued as self-assembled monolayers in high-performance metal halide perovskite photovoltaics and light-emitting diodes. We compare the deposition of (2-(3,6-diiodo-9H-carbazol-9-yl)ethyl)phosphonic acid onto indium tin oxide (ITO) bottom contacts by varying three parameters: the metho… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

  38. arXiv:2506.19180  [pdf, ps, other

    hep-ex hep-ph

    Precise Measurement of the $Λ$ Electric Dipole Moment through the Entangled Strange Baryon-Antibaryon System

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (696 additional authors not shown)

    Abstract: The dominance of matter over antimatter in the universe has consistently driven the pursuit of new physics beyond the Standard Model that violates charge-parity symmetry. Unlike the well-constrained electrons and neutrons, strange baryons (hyperons) remain a largely unexplored territory, in which interactions between hyperons and particles from new physics could induce a non-trivial electric dipol… ▽ More

    Submitted 28 June, 2025; v1 submitted 23 June, 2025; originally announced June 2025.

  39. arXiv:2506.18898  [pdf, ps, other

    cs.CV cs.AI cs.CL cs.MM

    Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations

    Authors: Jiaming Han, Hao Chen, Yang Zhao, Hanyu Wang, Qi Zhao, Ziyan Yang, Hao He, Xiangyu Yue, Lu Jiang

    Abstract: This paper presents a multimodal framework that attempts to unify visual understanding and generation within a shared discrete semantic representation. At its core is the Text-Aligned Tokenizer (TA-Tok), which converts images into discrete tokens using a text-aligned codebook projected from a large language model's (LLM) vocabulary. By integrating vision and text into a unified space with an expan… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

    Comments: Project page: https://tar.csuhan.com

  40. arXiv:2506.18871  [pdf, ps, other

    cs.CV cs.AI cs.CL

    OmniGen2: Exploration to Advanced Multimodal Generation

    Authors: Chenyuan Wu, Pengfei Zheng, Ruiran Yan, Shitao Xiao, Xin Luo, Yueze Wang, Wanli Li, Xiyan Jiang, Yexin Liu, Junjie Zhou, Ze Liu, Ziyi Xia, Chaofan Li, Haoge Deng, Jiahao Wang, Kun Luo, Bo Zhang, Defu Lian, Xinlong Wang, Zhongyuan Wang, Tiejun Huang, Zheng Liu

    Abstract: In this work, we introduce OmniGen2, a versatile and open-source generative model designed to provide a unified solution for diverse generation tasks, including text-to-image, image editing, and in-context generation. Unlike OmniGen v1, OmniGen2 features two distinct decoding pathways for text and image modalities, utilizing unshared parameters and a decoupled image tokenizer. This design enables… ▽ More

    Submitted 25 June, 2025; v1 submitted 23 June, 2025; originally announced June 2025.

  41. arXiv:2506.18856  [pdf, ps, other

    cs.CV

    RAG-6DPose: Retrieval-Augmented 6D Pose Estimation via Leveraging CAD as Knowledge Base

    Authors: Kuanning Wang, Yuqian Fu, Tianyu Wang, Yanwei Fu, Longfei Liang, Yu-Gang Jiang, Xiangyang Xue

    Abstract: Accurate 6D pose estimation is key for robotic manipulation, enabling precise object localization for tasks like grasping. We present RAG-6DPose, a retrieval-augmented approach that leverages 3D CAD models as a knowledge base by integrating both visual and geometric cues. Our RAG-6DPose roughly contains three stages: 1) Building a Multi-Modal CAD Knowledge Base by extracting 2D visual features fro… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

    Comments: Accepted by IROS 2025

  42. arXiv:2506.18786  [pdf, ps, other

    cs.HC

    Flow-Aware Diffusion for Real-Time VR Restoration: Enhancing Spatiotemporal Coherence and Efficiency

    Authors: Yitong Zhu, Guanxuan Jiang, Zhuowen Liang, Yuyang Wang

    Abstract: Cybersickness remains a critical barrier to the widespread adoption of Virtual Reality (VR), particularly in scenarios involving intense or artificial motion cues. Among the key contributors is excessive optical flow-perceived visual motion that, when unmatched by vestibular input, leads to sensory conflict and discomfort. While previous efforts have explored geometric or hardware based mitigation… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

  43. arXiv:2506.18734  [pdf, ps, other

    quant-ph

    Can boundary configuration be tuned to optimize directional quantum steering harvesting?

    Authors: Xiao-Li Huang, Xiao-Ying Jiang, Yu-Xuan Wang, Si-Yu Liu, Zejun Wang, Shu-Min Wu

    Abstract: We investigate the harvesting of quantum steering and its asymmetry between two static detectors locally interacting with a vacuum massless scalar field near an infinite, perfectly reflecting boundary. The detectors are arranged either parallel or orthogonal to the boundary, with detector $B$ assumed to have an energy gap greater than or equal to that of detector $A$. It is interesting to observe… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

    Comments: 20 pages, 7 figures,

  44. arXiv:2506.18701  [pdf, ps, other

    cs.CV cs.AI

    Matrix-Game: Interactive World Foundation Model

    Authors: Yifan Zhang, Chunli Peng, Boyang Wang, Puyi Wang, Qingcheng Zhu, Fei Kang, Biao Jiang, Zedong Gao, Eric Li, Yang Liu, Yahui Zhou

    Abstract: We introduce Matrix-Game, an interactive world foundation model for controllable game world generation. Matrix-Game is trained using a two-stage pipeline that first performs large-scale unlabeled pretraining for environment understanding, followed by action-labeled training for interactive video generation. To support this, we curate Matrix-Game-MC, a comprehensive Minecraft dataset comprising ove… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

    Comments: Technical Report

  45. arXiv:2506.18636  [pdf

    physics.optics

    A comparative analysis of plasmonic and dielectric metasurface sensing platforms powered by bound states in the continuum

    Authors: Tao Jiang, Angana Bhattacharya, Martin Barkey, Andreas Aigner, Thomas Weber, Juan Wang, Stefan A. Maier, Andreas Tittl

    Abstract: Nanophotonic platforms based on surface-enhanced infrared absorbance spectroscopy (SEIRAS) have emerged as an effective tool for molecular detection. Sensitive nanophotonic sensors with robust resonant modes and amplified electromagnetic near fields are essential for spectroscopy, especially in lossy environments. Metasurfaces driven by bound state in the continuum (BICs) have unlocked a powerful… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

  46. arXiv:2506.18606  [pdf, ps, other

    hep-ph hep-ex hep-lat

    A hybrid nonet with $J^{PC}=1^{-+}$ or a tetraquark 81-plet

    Authors: Niu Su, Er-Liang Cui, Yi-Wei Jiang, Hua-Xing Chen

    Abstract: Confirming the existence of hybrid states remains challenging due to their experimental indistinguishability from tightly bound tetraquarks and loosely bound molecules. To address this issue, we employ QCD sum rules to systematically investigate the $π_1(1600)$ and $η_1(1855)$ as candidate tetraquark states with exotic quantum numbers $J^{PC} = 1^{-+}$. Within the hybrid framework, an $SU(3)$ flav… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

    Comments: 6 pages, 3 figures, suggestions and comments welcome

  47. arXiv:2506.18506  [pdf

    physics.ins-det quant-ph

    Detection of subsurface structures with a vehicle-based atom gravity gradiometer

    Authors: Xiaowei Zhang, Jiaqi Zhong, Muyan Wang, Huilin Wan, Hui Xiong, Dandan Jiang, Zhi Li, Dekai Mao, Bin Gao, Biao Tang, Xi Chen, Jin Wang, Mingsheng Zhan

    Abstract: High-precision mobile gravity gradiometers are very useful in geodesy and geophysics. Atom gravity gradiometers (AGGs) could be among the most accurate mobile gravity gradiometers but are currently constrained by the trade-off between portability and sensitivity. Here, we present a high-sensitivity mobile AGG featuring an ultra-compact sensor head with a volume of only 94 L. In the laboratory, it… ▽ More

    Submitted 25 June, 2025; v1 submitted 23 June, 2025; originally announced June 2025.

    Comments: 13 pages, 8 figures

  48. arXiv:2506.18478  [pdf, ps, other

    stat.AP

    High-Dimensional Multi-Study Robust Factor Model for Analyzing RNA Sequencing Data from Heterogeneous Sources

    Authors: Xiaolu Jiang, Wei Liu

    Abstract: The amount of high-dimensional large-scale RNA sequencing data derived from multiple heterogeneous sources has increased exponentially in biological science. During data collection, significant technical noise or errors may occur. To robustly extract meaningful features from this type of data, we introduce a high-dimensional multi-study robust factor model, called MultiRFM, which learns latent fea… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

    Comments: 36 pages,4 figures

  49. arXiv:2506.18476  [pdf, ps, other

    cs.CV

    Context Consistency Learning via Sentence Removal for Semi-Supervised Video Paragraph Grounding

    Authors: Yaokun Zhong, Siyu Jiang, Jian Zhu, Jian-Fang Hu

    Abstract: Semi-Supervised Video Paragraph Grounding (SSVPG) aims to localize multiple sentences in a paragraph from an untrimmed video with limited temporal annotations. Existing methods focus on teacher-student consistency learning and video-level contrastive loss, but they overlook the importance of perturbing query contexts to generate strong supervisory signals. In this work, we propose a novel Context… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

    Comments: Accepted by ICME2025

  50. arXiv:2506.18420  [pdf, ps, other

    math.AP

    Incompressible Euler limit from the Boltzmann equation with Maxwell reflection boundary condition in the half-space

    Authors: Ning Jiang, Chao Wang, Yulong Wu, Zhifei Zhang

    Abstract: In this paper, we rigorously justify the incompressible Euler limit of the Boltzmann equation with general Maxwell reflection boundary condition in the half-space. The accommodation coefficient $α\in (0,1]$ is assumed to be $O(1)$. Our construction of solutions includes the interior fluid part and Knudsen-Prandtl coupled boundary layers. The corresponding solutions to the nonlinear Euler and nonli… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

    MSC Class: 35B25; 35F20; 35Q20; 76N15; 82C40