Skip to main content

Showing 1–50 of 360 results for author: Shi, K

.
  1. arXiv:2507.01316  [pdf, ps, other

    physics.optics

    POST: Photonic Swin Transformer for Automated and Efficient Prediction of PCSEL

    Authors: Qi Xin, Hai Huang, Chenyu Li, Kewei Shi, Zhaoyu Zhang

    Abstract: This work designs a model named POST based on the Vision Transformer (ViT) approach. Across single, double, and even triple lattices, as well as various non-circular complex hole structures, POST enables prediction of multiple optical properties of photonic crystal layers in Photonic Crystal Surface Emitting Lasers (PCSELs) with high speed and accuracy, without requiring manual intervention, which… ▽ More

    Submitted 1 July, 2025; originally announced July 2025.

  2. arXiv:2506.15803  [pdf, ps, other

    physics.med-ph cs.AI

    Unsupervised deep learning model for fast energy layer pre-selection of delivery-efficient proton arc therapy plan optimization of nasopharyngeal carcinoma

    Authors: Bohan Yang, Gang Liu, Rirao Dao, Yujia Qian, Ke Shi, Anke Tang, Yong Luo, Jingnan Liu

    Abstract: Objective. Proton arc therapy (PAT) is an emerging and promising modality in radiotherapy, offering several advantages over conventional intensitymodulated proton therapy (IMPT). However, identifying the optimal energy layer (EL) sequence remains computationally intensive due to the large number of possible energy layer transitions. This study proposes an unsupervised deep learning framework for f… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

  3. arXiv:2506.13460  [pdf, ps, other

    physics.med-ph

    First Positronium Lifetime Imaging with Scandium-44 on a Long Axial Field-of-view PET/CT

    Authors: Lorenzo Mercolli, William M. Steinberger, Pascal V. Grundler, Anzhelika Moiseeva, Saverio Braccini, Maurizio Conti, Paweł Moskal, Narendra Rathod, Axel Rominger, Hasan Sari, Roger Schibli, Robert Seifert, Kuangyu Shi, Ewa Ł. Stępień, Nicholas P. van der Meulen

    Abstract: Purpose: 44Sc has been successfully produced, synthesized, labeled and first-in-human studies were conducted some years ago. The decay properties of 44Sc, together with being close to a clinical implementation, make it an ideal candidate for in vivo positronium lifetime measurements. In this study, we investigate the count statistics for ortho-positronium (oPs) measurements with 44Sc. Method: A… ▽ More

    Submitted 17 June, 2025; v1 submitted 16 June, 2025; originally announced June 2025.

  4. arXiv:2506.12710  [pdf, ps, other

    cs.RO

    Multimodal Large Language Models-Enabled UAV Swarm: Towards Efficient and Intelligent Autonomous Aerial Systems

    Authors: Yuqi Ping, Tianhao Liang, Huahao Ding, Guangyu Lei, Junwei Wu, Xuan Zou, Kuan Shi, Rui Shao, Chiya Zhang, Weizheng Zhang, Weijie Yuan, Tingting Zhang

    Abstract: Recent breakthroughs in multimodal large language models (MLLMs) have endowed AI systems with unified perception, reasoning and natural-language interaction across text, image and video streams. Meanwhile, Unmanned Aerial Vehicle (UAV) swarms are increasingly deployed in dynamic, safety-critical missions that demand rapid situational understanding and autonomous adaptation. This paper explores pot… ▽ More

    Submitted 14 June, 2025; originally announced June 2025.

    Comments: 8 pages, 5 figures,submitted to IEEE wcm

  5. arXiv:2506.08795  [pdf, other

    cs.RO cs.AI

    Towards Biosignals-Free Autonomous Prosthetic Hand Control via Imitation Learning

    Authors: Kaijie Shi, Wanglong Lu, Hanli Zhao, Vinicius Prado da Fonseca, Ting Zou, Xianta Jiang

    Abstract: Limb loss affects millions globally, impairing physical function and reducing quality of life. Most traditional surface electromyographic (sEMG) and semi-autonomous methods require users to generate myoelectric signals for each control, imposing physically and mentally taxing demands. This study aims to develop a fully autonomous control system that enables a prosthetic hand to automatically grasp… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

  6. arXiv:2506.07230  [pdf, ps, other

    physics.med-ph physics.ins-det

    First positronium imaging using $^{44}$Sc with the J-PET scanner: a case study on the NEMA-Image Quality phantom

    Authors: Manish Das, Sushil Sharma, Aleksander Bilewicz, Jarosław Choiński, Neha Chug, Catalina Curceanu, Eryk Czerwiński, Jakub Hajduga, Sharareh Jalali, Krzysztof Kacprzak, Tevfik Kaplanoglu, Łukasz Kapłon, Kamila Kasperska, Aleksander Khreptak, Grzegorz Korcyl, Tomasz Kozik, Karol Kubat, Deepak Kumar, Anoop Kunimmal Venadan, Edward Lisowski, Filip Lisowski, Justyna Medrala-Sowa, Simbarashe Moyo, Wiktor Mryka, Szymon Niedźwiecki , et al. (19 additional authors not shown)

    Abstract: Positronium Lifetime Imaging (PLI), an emerging extension of conventional positron emission tomography (PET) imaging, offers a novel window for probing the submolecular properties of biological tissues by imaging the mean lifetime of the positronium atom. Currently, the method is under rapid development in terms of reconstruction and detection systems. Recently, the first in vivo PLI of the human… ▽ More

    Submitted 8 June, 2025; originally announced June 2025.

  7. arXiv:2505.23461  [pdf, ps, other

    cs.CL

    UAQFact: Evaluating Factual Knowledge Utilization of LLMs on Unanswerable Questions

    Authors: Chuanyuan Tan, Wenbiao Shao, Hao Xiong, Tong Zhu, Zhenhua Liu, Kai Shi, Wenliang Chen

    Abstract: Handling unanswerable questions (UAQ) is crucial for LLMs, as it helps prevent misleading responses in complex situations. While previous studies have built several datasets to assess LLMs' performance on UAQ, these datasets lack factual knowledge support, which limits the evaluation of LLMs' ability to utilize their factual knowledge when handling UAQ. To address the limitation, we introduce a ne… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

    Comments: ACL 2025 Findings

  8. arXiv:2505.17825  [pdf, ps, other

    math.PR

    Perfect Matchings on Doubly Free Boundary Rail-Yard Graph with Macdonald Weights

    Authors: Zhongyang Li, Kaili Shi

    Abstract: We investigate the asymptotic behavior of perfect matchings on rail-yard graphs with doubly free boundary conditions and Jack weights. While a special case of this model reduces to the half space Macdonald process with Jack weights introduced by Barraquand, Borodin, and Corwin [3], the asymptotic behavior in the general Jack-weighted free boundary setting considered here has, to our knowledge, rem… ▽ More

    Submitted 25 May, 2025; v1 submitted 23 May, 2025; originally announced May 2025.

    Comments: The paper shared definitions with arXiv:2304.00650 and arXiv:2110.11393

  9. arXiv:2505.11010  [pdf, ps, other

    cs.CL cs.AI

    ReviewInstruct: A Review-Driven Multi-Turn Conversations Generation Method for Large Language Models

    Authors: Jiangxu Wu, Cong Wang, TianHuang Su, Jun Yang, Haozhi Lin, Chao Zhang, Ming Peng, Kai Shi, SongPan Yang, BinQing Pan, ZiXian Li, Ni Yang, ZhenYu Yang

    Abstract: The effectiveness of large language models (LLMs) in conversational AI is hindered by their reliance on single-turn supervised fine-tuning (SFT) data, which limits contextual coherence in multi-turn dialogues. Existing methods for generating multi-turn dialogue data struggle to ensure both diversity and quality in instructions. To address this, we propose Review-Instruct, a novel framework that sy… ▽ More

    Submitted 4 July, 2025; v1 submitted 16 May, 2025; originally announced May 2025.

    Comments: ACL2025 Accepted

  10. arXiv:2505.06547  [pdf

    physics.app-ph

    Nonlinearity Modulation of Auto-oscillations in Three-terminal Magnetic Tunnel Junctions

    Authors: Zixi Wang, Wenlong Cai, Ao Du, Zanhong Chen, Lei Zhou, Shiyang Lu, Kewen Shi, Weisheng Zhao

    Abstract: Spin torque nano-oscillators (STNOs) hold encouraging promise for nanoscale microwave generators, modulators, and new types of intelligent computing. The nonlinearity, describing the current-induced tunability of oscillating frequency, is a distinctive feature of STNOs, which plays important roles in efficient manipulation of microwave frequencies, rapid spec-trum analysis, and the design of neuro… ▽ More

    Submitted 10 May, 2025; originally announced May 2025.

    Comments: 7 pages, 4 figures

  11. arXiv:2505.03770  [pdf, other

    cs.AI

    Proceedings of 1st Workshop on Advancing Artificial Intelligence through Theory of Mind

    Authors: Mouad Abrini, Omri Abend, Dina Acklin, Henny Admoni, Gregor Aichinger, Nitay Alon, Zahra Ashktorab, Ashish Atreja, Moises Auron, Alexander Aufreiter, Raghav Awasthi, Soumya Banerjee, Joe M. Barnby, Rhea Basappa, Severin Bergsmann, Djallel Bouneffouf, Patrick Callaghan, Marc Cavazza, Thierry Chaminade, Sonia Chernova, Mohamed Chetouan, Moumita Choudhury, Axel Cleeremans, Jacek B. Cywinski, Fabio Cuzzolin , et al. (83 additional authors not shown)

    Abstract: This volume includes a selection of papers presented at the Workshop on Advancing Artificial Intelligence through Theory of Mind held at AAAI 2025 in Philadelphia US on 3rd March 2025. The purpose of this volume is to provide an open access and curated anthology for the ToM and AI research community.

    Submitted 28 April, 2025; originally announced May 2025.

    Comments: workshop proceedings

  12. arXiv:2504.20353  [pdf

    physics.geo-ph

    Study on impact mechanism and precursor information induced by high intensity mining

    Authors: Kaiwen Shi, Wenhao Shi, Shankun Zhao, Hongfei Duan, Yuwei Li, Haojie Xue, Xueyi Shang, Wengang Dang, Peng Li, Yunfei Zhang, Binghuo Guan, Xiang Ma, Hongke Gao

    Abstract: With heightened mining intensity, the incidence of coal bursts is escalating, necessitating advanced understanding and prediction techniques. This research delves into the intricacies of coal burst mechanisms, proposing a novel theoretical model for the release of coal mass energy founded on the tenets of stress superposition. A significant revelation is that the energy culminating in a coal burst… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

  13. arXiv:2504.19497  [pdf, ps, other

    eess.SY cs.LG math.OC

    Negative Imaginary Neural ODEs: Learning to Control Mechanical Systems with Stability Guarantees

    Authors: Kanghong Shi, Ruigang Wang, Ian R. Manchester

    Abstract: We propose a neural control method to provide guaranteed stabilization for mechanical systems using a novel negative imaginary neural ordinary differential equation (NINODE) controller. Specifically, we employ neural networks with desired properties as state-space function matrices within a Hamiltonian framework to ensure the system possesses the NI property. This NINODE system can serve as a cont… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

  14. arXiv:2504.19295  [pdf, other

    cs.CV

    FusionNet: Multi-model Linear Fusion Framework for Low-light Image Enhancement

    Authors: Kangbiao Shi, Yixu Feng, Tao Hu, Yu Cao, Peng Wu, Yijin Liang, Yanning Zhang, Qingsen Yan

    Abstract: The advent of Deep Neural Networks (DNNs) has driven remarkable progress in low-light image enhancement (LLIE), with diverse architectures (e.g., CNNs and Transformers) and color spaces (e.g., sRGB, HSV, HVI) yielding impressive results. Recent efforts have sought to leverage the complementary strengths of these paradigms, offering promising solutions to enhance performance across varying degradat… ▽ More

    Submitted 27 April, 2025; originally announced April 2025.

  15. arXiv:2504.12225  [pdf, other

    astro-ph.GA astro-ph.CO

    Narrowband Imaging of a z=3.24 Protocluster: Insights from [O III] Emitting Galaxies

    Authors: Ke Shi, Jun Toshikawa, XianZhong Zheng, Zheng Cai, DongDong Shi

    Abstract: We present a narrowband imaging on a spectroscopically confirmed protocluster ``D4UD01'' at z=3.24 using CFHT/WIRCam. We identify a sample of 24 [O III] emission line galaxies in the field, which forms a large overdensity in the protocluster region. The protocluster is expected to evolve into a Virgo-like cluster by z=0. Utilizing multiwavelength data, we derive the physical properties of these [O… ▽ More

    Submitted 16 April, 2025; originally announced April 2025.

    Comments: 17 pages, 9 figures, accepted for publication in ApJ

  16. arXiv:2504.08486  [pdf

    cs.HC

    PlugSelect: Pruning Channels with Plug-and-Play Flexibility for Electroencephalography-based Brain Computer Interface

    Authors: Xue Yuan, Keren Shi, Ning Jiang, Jiayuan He

    Abstract: Automatic minimization and optimization of the number of the electrodes is essential for the practical application of electroencephalography (EEG)-based brain computer interface (BCI). Previous methods typically require additional training costs or rely on prior knowledge assumptions. This study proposed a novel channel pruning model, plug-and-select (PlugSelect), applicable across a broad range o… ▽ More

    Submitted 11 April, 2025; originally announced April 2025.

  17. arXiv:2504.04065  [pdf, ps, other

    cs.CV cs.IR cs.MM

    Enabling Collaborative Parametric Knowledge Calibration for Retrieval-Augmented Vision Question Answering

    Authors: Jiaqi Deng, Kaize Shi, Zonghan Wu, Huan Huo, Dingxian Wang, Guandong Xu

    Abstract: Knowledge-based Vision Question Answering (KB-VQA) systems address complex visual-grounded questions with knowledge retrieved from external knowledge bases. The tasks of knowledge retrieval and answer generation tasks both necessitate precise multimodal understanding of question context and external knowledge. However, existing methods treat these two stages as separate modules with limited intera… ▽ More

    Submitted 30 June, 2025; v1 submitted 5 April, 2025; originally announced April 2025.

    Comments: 10 pages, 5 figures, Under Review

  18. arXiv:2504.03753  [pdf, other

    cs.LG stat.ME

    MMCE: A Framework for Deep Monotonic Modeling of Multiple Causal Effects

    Authors: Juhua Chen, Karson shi, Jialing He, North Chen, Kele Jiang

    Abstract: When we plan to use money as an incentive to change the behavior of a person (such as making riders to deliver more orders or making consumers to buy more items), the common approach of this problem is to adopt a two-stage framework in order to maximize ROI under cost constraints. In the first stage, the individual price response curve is obtained. In the second stage, business goals and resource… ▽ More

    Submitted 1 April, 2025; originally announced April 2025.

  19. arXiv:2503.20349  [pdf, ps, other

    cs.CV

    Consistency Trajectory Matching for One-Step Generative Super-Resolution

    Authors: Weiyi You, Mingyang Zhang, Leheng Zhang, Xingyu Zhou, Kexuan Shi, Shuhang Gu

    Abstract: Current diffusion-based super-resolution (SR) approaches achieve commendable performance at the cost of high inference overhead. Therefore, distillation techniques are utilized to accelerate the multi-step teacher model into one-step student model. Nevertheless, these methods significantly raise training costs and constrain the performance of the student model by the teacher model. To overcome the… ▽ More

    Submitted 30 June, 2025; v1 submitted 26 March, 2025; originally announced March 2025.

  20. arXiv:2503.18512  [pdf, other

    cs.CV eess.IV

    Uncertainty-guided Perturbation for Image Super-Resolution Diffusion Model

    Authors: Leheng Zhang, Weiyi You, Kexuan Shi, Shuhang Gu

    Abstract: Diffusion-based image super-resolution methods have demonstrated significant advantages over GAN-based approaches, particularly in terms of perceptual quality. Building upon a lengthy Markov chain, diffusion-based methods possess remarkable modeling capacity, enabling them to achieve outstanding performance in real-world scenarios. Unlike previous methods that focus on modifying the noise schedule… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

    Comments: Accepted to CVPR 2025

  21. arXiv:2503.18363  [pdf, other

    cs.CV

    MonoInstance: Enhancing Monocular Priors via Multi-view Instance Alignment for Neural Rendering and Reconstruction

    Authors: Wenyuan Zhang, Yixiao Yang, Han Huang, Liang Han, Kanle Shi, Yu-Shen Liu, Zhizhong Han

    Abstract: Monocular depth priors have been widely adopted by neural rendering in multi-view based tasks such as 3D reconstruction and novel view synthesis. However, due to the inconsistent prediction on each view, how to more effectively leverage monocular cues in a multi-view context remains a challenge. Current methods treat the entire estimated depth map indiscriminately, and use it as ground truth super… ▽ More

    Submitted 30 March, 2025; v1 submitted 24 March, 2025; originally announced March 2025.

    Comments: Accepted by CVPR 2025. Project page: https://wen-yuan-zhang.github.io/MonoInstance/

  22. arXiv:2503.18361  [pdf, other

    cs.CV

    NeRFPrior: Learning Neural Radiance Field as a Prior for Indoor Scene Reconstruction

    Authors: Wenyuan Zhang, Emily Yue-ting Jia, Junsheng Zhou, Baorui Ma, Kanle Shi, Yu-Shen Liu, Zhizhong Han

    Abstract: Recently, it has shown that priors are vital for neural implicit functions to reconstruct high-quality surfaces from multi-view RGB images. However, current priors require large-scale pre-training, and merely provide geometric clues without considering the importance of color. In this paper, we present NeRFPrior, which adopts a neural radiance field as a prior to learn signed distance fields using… ▽ More

    Submitted 30 March, 2025; v1 submitted 24 March, 2025; originally announced March 2025.

    Comments: Accepted by CVPR 2025. Project page: https://wen-yuan-zhang.github.io/NeRFPrior/

  23. arXiv:2503.16635  [pdf, other

    eess.IV cs.CV

    Fed-NDIF: A Noise-Embedded Federated Diffusion Model For Low-Count Whole-Body PET Denoising

    Authors: Yinchi Zhou, Huidong Xie, Menghua Xia, Qiong Liu, Bo Zhou, Tianqi Chen, Jun Hou, Liang Guo, Xinyuan Zheng, Hanzhong Wang, Biao Li, Axel Rominger, Kuangyu Shi, Nicha C. Dvorneka, Chi Liu

    Abstract: Low-count positron emission tomography (LCPET) imaging can reduce patients' exposure to radiation but often suffers from increased image noise and reduced lesion detectability, necessitating effective denoising techniques. Diffusion models have shown promise in LCPET denoising for recovering degraded image quality. However, training such models requires large and diverse datasets, which are challe… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

  24. Positronium Imaging: History, Current Status, and Future Perspectives

    Authors: Paweł Moskal, Aleksander Bilewicz, Manish Das, Bangyan Huang, Aleksander Khreptak, Szymon Parzych, Jinyi Qi, Axel Rominger, Robert Seifert, Sushil Sharma, Kuangyu Shi, William Steinberger, Rafał Walczak, Ewa Stępień

    Abstract: Positronium imaging was recently proposed to image the properties of positronium atoms in the patient body. Positronium properties depend on the size of intramolecular voids and oxygen concentration; therefore, they deliver information different and complementary to the anatomic, morphological, and metabolic images. Thus far, the mean ortho-positronium lifetime imaging has been at the center of re… ▽ More

    Submitted 1 July, 2025; v1 submitted 18 March, 2025; originally announced March 2025.

    Comments: This manuscript has been approved for publication and is available in Early Access in IEEE Transactions on Radiation and Plasma Medical Sciences

  25. arXiv:2503.12045  [pdf, ps, other

    stat.ME cs.CR cs.LG

    Auditing Differential Privacy in the Black-Box Setting

    Authors: Kaining Shi, Cong Ma

    Abstract: This paper introduces a novel theoretical framework for auditing differential privacy (DP) in a black-box setting. Leveraging the concept of $f$-differential privacy, we explicitly define type I and type II errors and propose an auditing mechanism based on conformal inference. Our approach robustly controls the type I error rate under minimal assumptions. Furthermore, we establish a fundamental im… ▽ More

    Submitted 10 April, 2025; v1 submitted 15 March, 2025; originally announced March 2025.

    Comments: work in progress, comments are welcomed

  26. arXiv:2503.08703  [pdf, ps, other

    cs.NE cs.CV

    SDTrack: A Baseline for Event-based Tracking via Spiking Neural Networks

    Authors: Yimeng Shan, Zhenbang Ren, Haodi Wu, Wenjie Wei, Rui-Jie Zhu, Shuai Wang, Dehao Zhang, Yichen Xiao, Jieyuan Zhang, Kexin Shi, Jingzhinan Wang, Jason K. Eshraghian, Haicheng Qu, Jiqing Zhang, Malu Zhang, Yang Yang

    Abstract: Event cameras provide superior temporal resolution, dynamic range, power efficiency, and pixel bandwidth. Spiking Neural Networks (SNNs) naturally complement event data through discrete spike signals, making them ideal for event-based tracking. However, current approaches that combine Artificial Neural Networks (ANNs) and SNNs, along with suboptimal architectures, compromise energy efficiency and… ▽ More

    Submitted 17 June, 2025; v1 submitted 8 March, 2025; originally announced March 2025.

    Comments: 11 pages,7 figures,4 tables

  27. arXiv:2503.05367  [pdf, other

    physics.med-ph cs.LG

    Semi-Supervised Learning for Dose Prediction in Targeted Radionuclide: A Synthetic Data Study

    Authors: Jing Zhang, Alexandre Bousse, Laetitia Imbert, Song Xue, Kuangyu Shi, Julien Bert

    Abstract: Targeted Radionuclide Therapy (TRT) is a modern strategy in radiation oncology that aims to administer a potent radiation dose specifically to cancer cells using cancer-targeting radiopharmaceuticals. Accurate radiation dose estimation tailored to individual patients is crucial. Deep learning, particularly with pre-therapy imaging, holds promise for personalizing TRT doses. However, current method… ▽ More

    Submitted 7 March, 2025; originally announced March 2025.

    Comments: 12 pages, 13 figures, 5 tables

  28. arXiv:2503.03441  [pdf

    cond-mat.supr-con

    Geometric Asymmetry-Enhanced Nonreciprocal Supercurrent Transport Revealed by Second-Harmonic Response

    Authors: Yu He, Zifeng Wang, Jiaxu Li, Fenglin Zhong, Haozhe Yang, Kewen Shi, Le Wang, Guang Yang, Weisheng Zhao

    Abstract: Nonreciprocal transport in superconducting systems serves as a powerful probe of symmetry-breaking mechanisms, with the superconducting diode effect emerging as a key manifestation enabling cryogenic rectification. While theoretical models have extensively explored superconducting nonreciprocity, experimental verification remains challenging, as conventional transport measurements struggle to dise… ▽ More

    Submitted 12 April, 2025; v1 submitted 5 March, 2025; originally announced March 2025.

    Comments: 17 pages, 4 figures, v2

    Journal ref: Adv. Funct. Mater. 2025, 2505766

  29. arXiv:2503.02824  [pdf, other

    cs.CV cs.AI

    Developing a PET/CT Foundation Model for Cross-Modal Anatomical and Functional Imaging

    Authors: Yujin Oh, Robert Seifert, Yihan Cao, Christoph Clement, Justin Ferdinandus, Constantin Lapa, Alessandro Liebich, Michelle Amon, Johanna Enke, Sifan Song, Runqi Meng, Fang Zeng, Ning Guo, Xiang Li, Pedram Heidari, Axel Rominger, Kuangyu Shi, Quanzheng Li

    Abstract: In oncology, Positron Emission Tomography-Computed Tomography (PET/CT) is widely used in cancer diagnosis, staging, and treatment monitoring, as it combines anatomical details from CT with functional metabolic activity and molecular marker expression information from PET. However, existing artificial intelligence-driven PET/CT analyses rely predominantly on task-specific models trained from scratc… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

    Comments: 11 pages, 2 figures, 3 tables

  30. arXiv:2502.21260  [pdf, other

    eess.IV

    PET Image Denoising via Text-Guided Diffusion: Integrating Anatomical Priors through Text Prompts

    Authors: Boxiao Yu, Savas Ozdemir, Jiong Wu, Yizhou Chen, Ruogu Fang, Kuangyu Shi, Kuang Gong

    Abstract: Low-dose Positron Emission Tomography (PET) imaging presents a significant challenge due to increased noise and reduced image quality, which can compromise its diagnostic accuracy and clinical utility. Denoising diffusion probabilistic models (DDPMs) have demonstrated promising performance for PET image denoising. However, existing DDPM-based methods typically overlook valuable metadata such as pa… ▽ More

    Submitted 28 February, 2025; originally announced February 2025.

  31. arXiv:2502.20272  [pdf, other

    cs.CV cs.AI cs.LG

    HVI: A New Color Space for Low-light Image Enhancement

    Authors: Qingsen Yan, Yixu Feng, Cheng Zhang, Guansong Pang, Kangbiao Shi, Peng Wu, Wei Dong, Jinqiu Sun, Yanning Zhang

    Abstract: Low-Light Image Enhancement (LLIE) is a crucial computer vision task that aims to restore detailed visual information from corrupted low-light images. Many existing LLIE methods are based on standard RGB (sRGB) space, which often produce color bias and brightness artifacts due to inherent high color sensitivity in sRGB. While converting the images using Hue, Saturation and Value (HSV) color space… ▽ More

    Submitted 28 February, 2025; v1 submitted 27 February, 2025; originally announced February 2025.

    Comments: Qingsen Yan, Yixu Feng, and Cheng Zhang contributed equally to this work

  32. arXiv:2502.16479  [pdf, other

    quant-ph physics.atm-clus

    Enhanced response at exceptional points in multi-qubit systems for sensing

    Authors: Tingting Shi, Vasilii Smirnov, Kaiye Shi, Wei Zhang

    Abstract: Exceptional points featuring enhanced energy response to perturbation hold significant potential in detection and measurement of weak signals. Of particular interest is the existence and property of high-order exceptional points in quantum systems, owing to the capability to provide high-order response to perturbations. We investigate the exceptional points in a system of $n$ identical qubits poss… ▽ More

    Submitted 23 February, 2025; originally announced February 2025.

    Comments: 9 pages, 4 figures

    Journal ref: Phys. Rev. A 111, 032203 (2025)

  33. arXiv:2502.11338  [pdf, other

    cs.CV

    WRT-SAM: Foundation Model-Driven Segmentation for Generalized Weld Radiographic Testing

    Authors: Yunyi Zhou, Kun Shi, Gang Hao

    Abstract: Radiographic testing is a fundamental non-destructive evaluation technique for identifying weld defects and assessing quality in industrial applications due to its high-resolution imaging capabilities. Over the past decade, deep learning techniques have significantly advanced weld defect identification in radiographic images. However, conventional approaches, which rely on training small-scale, ta… ▽ More

    Submitted 16 February, 2025; originally announced February 2025.

  34. arXiv:2502.10608  [pdf, other

    cs.CV cs.LG

    Universal Lesion Segmentation Challenge 2023: A Comparative Research of Different Algorithms

    Authors: Kaiwen Shi, Yifei Li, Binh Ho, Jovian Wang, Kobe Guo

    Abstract: In recent years, machine learning algorithms have achieved much success in segmenting lesions across various tissues. There is, however, not one satisfying model that works well on all tissue types universally. In response to this need, we attempt to train a model that 1) works well on all tissue types, and 2) is capable of still performing fast inferences. To this end, we design our architectures… ▽ More

    Submitted 14 February, 2025; originally announced February 2025.

  35. arXiv:2502.10607  [pdf, other

    math.OC

    Time Parameterized Optimal Transport

    Authors: Kaiwen Shi

    Abstract: Optimal transport has gained significant attention in recent years due to its effectiveness in deep learning and computer vision. Its descendant metric, the Wasserstein distance, has been particularly successful in measuring distribution dissimilarities. While extensive research has focused on optimal transport and its regularized variants (such as entropy, sparsity, and capacity constraints) the… ▽ More

    Submitted 14 February, 2025; originally announced February 2025.

  36. arXiv:2502.00681  [pdf, other

    cs.LG cs.AI cs.CL

    A Survey of Quantized Graph Representation Learning: Connecting Graph Structures with Large Language Models

    Authors: Qika Lin, Zhen Peng, Kaize Shi, Kai He, Yiming Xu, Erik Cambria, Mengling Feng

    Abstract: Recent years have witnessed rapid advances in graph representation learning, with the continuous embedding approach emerging as the dominant paradigm. However, such methods encounter issues regarding parameter efficiency, interpretability, and robustness. Thus, Quantized Graph Representation (QGR) learning has recently gained increasing interest, which represents the graph structure with discrete… ▽ More

    Submitted 2 February, 2025; originally announced February 2025.

  37. arXiv:2501.14367  [pdf, other

    cs.NI eess.SP

    Joint System Latency and Data Freshness Optimization for Cache-enabled Mobile Crowdsensing Networks

    Authors: Kexin Shi, Yaru Fu, Yongna Guo, Fu Lee Wang, Yan Zhang

    Abstract: Mobile crowdsensing (MCS) networks enable large-scale data collection by leveraging the ubiquity of mobile devices. However, frequent sensing and data transmission can lead to significant resource consumption. To mitigate this issue, edge caching has been proposed as a solution for storing recently collected data. Nonetheless, this approach may compromise data freshness. In this paper, we investig… ▽ More

    Submitted 24 January, 2025; originally announced January 2025.

  38. arXiv:2501.14179  [pdf, other

    cs.HC

    AI Chatbots as Professional Service Agents: Developing a Professional Identity

    Authors: Wenwen Li, Kangwei Shi, Yidong Chai

    Abstract: With the rapid expansion of large language model (LLM) applications, there is an emerging shift in the role of LLM-based AI chatbots from serving merely as general inquiry tools to acting as professional service agents. However, current studies often overlook a critical aspect of professional service agents: the act of communicating in a manner consistent with their professional identities. This i… ▽ More

    Submitted 23 January, 2025; originally announced January 2025.

  39. arXiv:2501.04145  [pdf, other

    physics.med-ph

    Positronium Lifetime Imaging with the Biograph Vision Quadra using 124I

    Authors: Lorenzo Mercolli, William M. Steinberger, Narendra Rathod, Maurizio Conti, Paweł Moskal, Axel Rominger, Robert Seifert, Kuangyu Shi, Ewa Ł. Stępień, Hasan Sari

    Abstract: Purpose: Measuring the ortho-positronium (oPs) lifetime in human tissue bears the potential of adding clinically relevant information about the tissue microenvironment to conventional positron emission tomography (PET). Through phantom measurements, we investigate the voxel-wise measurement of oPs lifetime using a commercial long-axial field-of-view (LAFOV) PET scanner. Methods: We prepared four s… ▽ More

    Submitted 7 January, 2025; originally announced January 2025.

  40. arXiv:2501.03571  [pdf

    cs.LG cs.SD eess.AS q-bio.NC

    AADNet: Exploring EEG Spatiotemporal Information for Fast and Accurate Orientation and Timbre Detection of Auditory Attention Based on A Cue-Masked Paradigm

    Authors: Keren Shi, Xu Liu, Xue Yuan, Haijie Shang, Ruiting Dai, Hanbin Wang, Yunfa Fu, Ning Jiang, Jiayuan He

    Abstract: Auditory attention decoding from electroencephalogram (EEG) could infer to which source the user is attending in noisy environments. Decoding algorithms and experimental paradigm designs are crucial for the development of technology in practical applications. To simulate real-world scenarios, this study proposed a cue-masked auditory attention paradigm to avoid information leakage before the exper… ▽ More

    Submitted 7 January, 2025; originally announced January 2025.

  41. Evaluation of Deep Learning-based Scatter Correction on a Long-axial Field-of-view PET scanner

    Authors: Baptiste Laurent, Alexandre Bousse, Thibaut Merlin, Axel Rominger, Kuangyu Shi, Dimitris Visvikis

    Abstract: Objective: Long-axial field-of-view (LAFOV) positron emission tomography (PET) systems allow higher sensitivity, with an increased number of detected lines of response induced by a larger angle of acceptance. However, this extended angle increases the number of multiple scatters and the scatter contribution within oblique planes. As scattering affects both quality and quantification of the reconst… ▽ More

    Submitted 7 February, 2025; v1 submitted 2 January, 2025; originally announced January 2025.

    Comments: 15 pages, 10 figures, 3 tables

  42. arXiv:2412.16720  [pdf, other

    cs.AI

    OpenAI o1 System Card

    Authors: OpenAI, :, Aaron Jaech, Adam Kalai, Adam Lerer, Adam Richardson, Ahmed El-Kishky, Aiden Low, Alec Helyar, Aleksander Madry, Alex Beutel, Alex Carney, Alex Iftimie, Alex Karpenko, Alex Tachard Passos, Alexander Neitz, Alexander Prokofiev, Alexander Wei, Allison Tam, Ally Bennett, Ananya Kumar, Andre Saraiva, Andrea Vallone, Andrew Duberstein, Andrew Kondrich , et al. (238 additional authors not shown)

    Abstract: The o1 model series is trained with large-scale reinforcement learning to reason using chain of thought. These advanced reasoning capabilities provide new avenues for improving the safety and robustness of our models. In particular, our models can reason about our safety policies in context when responding to potentially unsafe prompts, through deliberative alignment. This leads to state-of-the-ar… ▽ More

    Submitted 21 December, 2024; originally announced December 2024.

  43. arXiv:2412.06833  [pdf

    cs.LG cs.AI cs.SI

    Detecting Fake News on Social Media: A Novel Reliability Aware Machine-Crowd Hybrid Intelligence-Based Method

    Authors: Yidong Chai, Kangwei Shi, Jiaheng Xie, Chunli Liu, Yuanchun Jiang, Yezheng Liu

    Abstract: Fake news on social media platforms poses a significant threat to societal systems, underscoring the urgent need for advanced detection methods. The existing detection methods can be divided into machine intelligence-based, crowd intelligence-based, and hybrid intelligence-based methods. Among them, hybrid intelligence-based methods achieve the best performance but fail to consider the reliability… ▽ More

    Submitted 6 December, 2024; originally announced December 2024.

  44. arXiv:2412.02086  [pdf, ps, other

    cond-mat.quant-gas cond-mat.mes-hall quant-ph

    Anomalous wave-packet transport on boundaries of Floquet topological systems

    Authors: Xin-Xin Yang, Kai-Ye Shi, F. Nur Ünal, Wei Zhang

    Abstract: A two-dimensional periodically driven (Floquet) system with zero winding number in the absence of time-reversal symmetry is usually considered topologically trivial. Here, we study the dynamics of a Gaussian wave packet placed at the boundary of a two-dimensional driven system with zero winding numbers but multiple valley-protected edge states that can be realized in a square Raman lattice, and in… ▽ More

    Submitted 4 December, 2024; v1 submitted 2 December, 2024; originally announced December 2024.

    Journal ref: Phys. Rev. Research 7, 023077 (2025)

  45. arXiv:2411.16569  [pdf, other

    q-fin.PM q-fin.CP

    Predictive Power of LLMs in Financial Markets

    Authors: Jerick Shi, Burton Hollifield

    Abstract: Predicting the movement of the stock market and other assets has been valuable over the past few decades. Knowing how the value of a certain sector market may move in the future provides much information for investors, as they use that information to develop strategies to maximize profit or minimize risk. However, market data are quite noisy, and it is challenging to choose the right data or the r… ▽ More

    Submitted 25 November, 2024; originally announced November 2024.

  46. arXiv:2411.12601  [pdf, ps, other

    math.NA cs.LG

    Hypergraph $p$-Laplacian equations for data interpolation and semi-supervised learning

    Authors: Kehan Shi, Martin Burger

    Abstract: Hypergraph learning with $p$-Laplacian regularization has attracted a lot of attention due to its flexibility in modeling higher-order relationships in data. This paper focuses on its fast numerical implementation, which is challenging due to the non-differentiability of the objective function and the non-uniqueness of the minimizer. We derive a hypergraph $p$-Laplacian equation from the subdiffer… ▽ More

    Submitted 7 April, 2025; v1 submitted 19 November, 2024; originally announced November 2024.

    Comments: 17 pages

    MSC Class: 35R02; 65D05

  47. arXiv:2411.09363  [pdf, other

    eess.IV

    When Mamba Meets xLSTM: An Efficient and Precise Method with the xLSTM-VMUNet Model for Skin lesion Segmentation

    Authors: Zhuoyi Fang, Jiajia Liu, Kexuan Shi, Qiang Han

    Abstract: Automatic melanoma segmentation is essential for early skin cancer detection, yet challenges arise from the heterogeneity of melanoma, as well as interfering factors like blurred boundaries, low contrast, and imaging artifacts. While numerous algorithms have been developed to address these issues, previous approaches have often overlooked the need to jointly capture spatial and sequential features… ▽ More

    Submitted 12 March, 2025; v1 submitted 14 November, 2024; originally announced November 2024.

  48. arXiv:2411.06055  [pdf, other

    cs.LG math.MG

    Linear Spherical Sliced Optimal Transport: A Fast Metric for Comparing Spherical Data

    Authors: Xinran Liu, Yikun Bai, Rocío Díaz Martín, Kaiwen Shi, Ashkan Shahbazi, Bennett A. Landman, Catie Chang, Soheil Kolouri

    Abstract: Efficient comparison of spherical probability distributions becomes important in fields such as computer vision, geosciences, and medicine. Sliced optimal transport distances, such as spherical and stereographic spherical sliced Wasserstein distances, have recently been developed to address this need. These methods reduce the computational burden of optimal transport by slicing hyperspheres into o… ▽ More

    Submitted 8 November, 2024; originally announced November 2024.

  49. arXiv:2411.01733  [pdf, ps, other

    cond-mat.mes-hall physics.app-ph

    Dependence of Electrostatic Patch Force Evaluation on the Lateral Resolution of Kelvin Probe Force Microscopy

    Authors: Kun Shi, Pengshun Luo, Jinquan Liu, Hang Yin, Zebing Zhou

    Abstract: Kelvin Probe Force Microscopy (KPFM) is widely used to measure the surface potential on samples, from which electrostatic patch force can be calculated. However, since the KPFM measurements represent a weighted average of local potentials on the sample, the accuracy of the evaluation critically depends on the precision and lateral resolution of the method. In this paper, we investigate the influen… ▽ More

    Submitted 3 November, 2024; originally announced November 2024.

    Journal ref: Phys. Rev. D 110, 122007 (2024)

  50. arXiv:2410.21276  [pdf, other

    cs.CL cs.AI cs.CV cs.CY cs.LG cs.SD eess.AS

    GPT-4o System Card

    Authors: OpenAI, :, Aaron Hurst, Adam Lerer, Adam P. Goucher, Adam Perelman, Aditya Ramesh, Aidan Clark, AJ Ostrow, Akila Welihinda, Alan Hayes, Alec Radford, Aleksander Mądry, Alex Baker-Whitcomb, Alex Beutel, Alex Borzunov, Alex Carney, Alex Chow, Alex Kirillov, Alex Nichol, Alex Paino, Alex Renzin, Alex Tachard Passos, Alexander Kirillov, Alexi Christakis , et al. (395 additional authors not shown)

    Abstract: GPT-4o is an autoregressive omni model that accepts as input any combination of text, audio, image, and video, and generates any combination of text, audio, and image outputs. It's trained end-to-end across text, vision, and audio, meaning all inputs and outputs are processed by the same neural network. GPT-4o can respond to audio inputs in as little as 232 milliseconds, with an average of 320 mil… ▽ More

    Submitted 25 October, 2024; originally announced October 2024.