Skip to main content

Showing 1–50 of 175 results for author: Yue, J

.
  1. arXiv:2506.02587  [pdf, other

    cs.CV cs.RO

    BEVCALIB: LiDAR-Camera Calibration via Geometry-Guided Bird's-Eye View Representations

    Authors: Weiduo Yuan, Jerry Li, Justin Yue, Divyank Shah, Konstantinos Karydis, Hang Qiu

    Abstract: Accurate LiDAR-camera calibration is fundamental to fusing multi-modal perception in autonomous driving and robotic systems. Traditional calibration methods require extensive data collection in controlled environments and cannot compensate for the transformation changes during the vehicle/robot movement. In this paper, we propose the first model that uses bird's-eye view (BEV) features to perform… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  2. arXiv:2505.23014  [pdf, ps, other

    cs.LG

    Hyperbolic-PDE GNN: Spectral Graph Neural Networks in the Perspective of A System of Hyperbolic Partial Differential Equations

    Authors: Juwei Yue, Haikuo Li, Jiawei Sheng, Xiaodong Li, Taoyu Su, Tingwen Liu, Li Guo

    Abstract: Graph neural networks (GNNs) leverage message passing mechanisms to learn the topological features of graph data. Traditional GNNs learns node features in a spatial domain unrelated to the topology, which can hardly ensure topological features. In this paper, we formulates message passing as a system of hyperbolic partial differential equations (hyperbolic PDEs), constituting a dynamical system th… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

    Comments: 18 pages, 2 figures, published to ICML 2025

    Journal ref: International Conference on Machine Learning 2025

  3. Graph Wave Networks

    Authors: Juwei Yue, Haikuo Li, Jiawei Sheng, Yihan Guo, Xinghua Zhang, Chuan Zhou, Tingwen Liu, Li Guo

    Abstract: Dynamics modeling has been introduced as a novel paradigm in message passing (MP) of graph neural networks (GNNs). Existing methods consider MP between nodes as a heat diffusion process, and leverage heat equation to model the temporal evolution of nodes in the embedding space. However, heat equation can hardly depict the wave nature of graph signals in graph signal processing. Besides, heat equat… ▽ More

    Submitted 28 May, 2025; v1 submitted 26 May, 2025; originally announced May 2025.

    Comments: 15 pages, 8 figures, published to WWW 2025

    Journal ref: The ACM Web Conference 2025

  4. arXiv:2505.18401  [pdf, ps, other

    cs.CV

    Recent Deep Learning in Crowd Behaviour Analysis: A Brief Review

    Authors: Jiangbei Yue, He Wang

    Abstract: Crowd behaviour analysis is essential to numerous real-world applications, such as public safety and urban planning, and therefore has been studied for decades. In the last decade or so, the development of deep learning has significantly propelled the research on crowd behaviours. This chapter reviews recent advances in crowd behaviour analysis using deep learning. We mainly review the research in… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: 51 pages, 7 figures, Book Chapter

  5. arXiv:2505.11852  [pdf, other

    cs.CV

    MedSG-Bench: A Benchmark for Medical Image Sequences Grounding

    Authors: Jingkun Yue, Siqi Zhang, Zinan Jia, Huihuan Xu, Zongbo Han, Xiaohong Liu, Guangyu Wang

    Abstract: Visual grounding is essential for precise perception and reasoning in multimodal large language models (MLLMs), especially in medical imaging domains. While existing medical visual grounding benchmarks primarily focus on single-image scenarios, real-world clinical applications often involve sequential images, where accurate lesion localization across different modalities and temporal tracking of d… ▽ More

    Submitted 17 May, 2025; originally announced May 2025.

  6. arXiv:2505.05599  [pdf, ps, other

    cs.CV cs.AI

    Enhancing Satellite Object Localization with Dilated Convolutions and Attention-aided Spatial Pooling

    Authors: Seraj Al Mahmud Mostafa, Chenxi Wang, Jia Yue, Yuta Hozumi, Jianwu Wang

    Abstract: Object localization in satellite imagery is particularly challenging due to the high variability of objects, low spatial resolution, and interference from noise and dominant features such as clouds and city lights. In this research, we focus on three satellite datasets: upper atmospheric Gravity Waves (GW), mesospheric Bores (Bore), and Ocean Eddies (OE), each presenting its own unique challenges.… ▽ More

    Submitted 8 May, 2025; originally announced May 2025.

    Comments: This paper has been accepted to International conference on Advanced Machine Learning and Data Science (AMLDS) 2025

  7. arXiv:2505.02135  [pdf, other

    cond-mat.mtrl-sci

    Diffuson-Dominated Thermal Transport Crossover from Ordered to Liquid-like Cu$_3$BiS$_3$:The Negligible Role of Ion Hopping

    Authors: Jincheng Yue, Jiongzhi Zheng, Xingchen Shen, Chun-Chuen Yang, Shuyao Lin, Yanhui Liu, Tian Cui

    Abstract: Fundamentally understanding lattice dynamics and thermal transport behavior in liquid-like, partially occupied compounds remains a long-standing challenge in condensed matter physics. Here, we investigate the microscopic mechanisms underlying the ultralow thermal conductivity in ordered/liquid-like Cu$_3$BiS$_3$ by combining experimental methods with first-principles calculations. We first experim… ▽ More

    Submitted 4 May, 2025; originally announced May 2025.

  8. arXiv:2505.01168  [pdf, other

    cs.LG cs.AI

    Harmonizing Intra-coherence and Inter-divergence in Ensemble Attacks for Adversarial Transferability

    Authors: Zhaoyang Ma, Zhihao Wu, Wang Lu, Xin Gao, Jinghang Yue, Taolin Zhang, Lipo Wang, Youfang Lin, Jing Wang

    Abstract: The development of model ensemble attacks has significantly improved the transferability of adversarial examples, but this progress also poses severe threats to the security of deep neural networks. Existing methods, however, face two critical challenges: insufficient capture of shared gradient directions across models and a lack of adaptive weight allocation mechanisms. To address these issues, w… ▽ More

    Submitted 2 May, 2025; originally announced May 2025.

  9. arXiv:2504.20303  [pdf, other

    cs.CV

    DeepAndes: A Self-Supervised Vision Foundation Model for Multi-Spectral Remote Sensing Imagery of the Andes

    Authors: Junlin Guo, James R. Zimmer-Dauphinee, Jordan M. Nieusma, Siqi Lu, Quan Liu, Ruining Deng, Can Cui, Jialin Yue, Yizhe Lin, Tianyuan Yao, Juming Xiong, Junchao Zhu, Chongyu Qu, Yuechen Yang, Mitchell Wilkes, Xiao Wang, Parker VanValkenburgh, Steven A. Wernke, Yuankai Huo

    Abstract: By mapping sites at large scales using remotely sensed data, archaeologists can generate unique insights into long-term demographic trends, inter-regional social networks, and past adaptations to climate change. Remote sensing surveys complement field-based approaches, and their reach can be especially great when combined with deep learning and computer vision techniques. However, conventional sup… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

  10. arXiv:2504.19458  [pdf, other

    cs.MM cs.CL cs.IR

    Mitigating Modality Bias in Multi-modal Entity Alignment from a Causal Perspective

    Authors: Taoyu Su, Jiawei Sheng, Duohe Ma, Xiaodong Li, Juwei Yue, Mengxiao Song, Yingkai Tang, Tingwen Liu

    Abstract: Multi-Modal Entity Alignment (MMEA) aims to retrieve equivalent entities from different Multi-Modal Knowledge Graphs (MMKGs), a critical information retrieval task. Existing studies have explored various fusion paradigms and consistency constraints to improve the alignment of equivalent entities, while overlooking that the visual modality may not always contribute positively. Empirically, entities… ▽ More

    Submitted 15 May, 2025; v1 submitted 27 April, 2025; originally announced April 2025.

    Comments: Accepted by SIGIR 2025, 11 pages, 10 figures, 4 tables,

  11. arXiv:2504.16516  [pdf, other

    cs.CV cs.AI

    Think Hierarchically, Act Dynamically: Hierarchical Multi-modal Fusion and Reasoning for Vision-and-Language Navigation

    Authors: Junrong Yue, Yifan Zhang, Chuan Qin, Bo Li, Xiaomin Lie, Xinlei Yu, Wenxin Zhang, Zhendong Zhao

    Abstract: Vision-and-Language Navigation (VLN) aims to enable embodied agents to follow natural language instructions and reach target locations in real-world environments. While prior methods often rely on either global scene representations or object-level features, these approaches are insufficient for capturing the complex interactions across modalities required for accurate navigation. In this paper, w… ▽ More

    Submitted 24 April, 2025; v1 submitted 23 April, 2025; originally announced April 2025.

    Comments: 11 pages, 4 figures, Submitted to ACM MM 2025

  12. arXiv:2503.17418  [pdf

    q-bio.GN

    Application of Single-cell Deep Learning in Elucidating the Mapping Relationship Between Visceral and Body Surface Inflammatory Patterns

    Authors: Haixiang Huang, Bingbing Shen, Zhenwei Zhang, Jianming Yue, Lu Mei, Qiusheng Chen

    Abstract: As a system of integrated homeostasis, life is susceptible to disruptions by visceral inflammation, which can disturb internal environment equilibrium. The role of body-spread subcutaneous fascia (scFascia) in this process is poorly understood. In the rat model of Salmonella-induced dysentery, scRNA-seq of scFascia and deep-learning analysis revealed Warburg-like metabolic reprogramming in macroph… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

    Comments: 25pages, 7 figures, under review

  13. arXiv:2503.12168  [pdf, other

    cs.CV

    Learning Extremely High Density Crowds as Active Matters

    Authors: Feixiang He, Jiangbei Yue, Jialin Zhu, Armin Seyfried, Dan Casas, Julien Pettré, He Wang

    Abstract: Video-based high-density crowd analysis and prediction has been a long-standing topic in computer vision. It is notoriously difficult due to, but not limited to, the lack of high-quality data and complex crowd dynamics. Consequently, it has been relatively under studied. In this paper, we propose a new approach that aims to learn from in-the-wild videos, often with low quality where it is difficul… ▽ More

    Submitted 15 March, 2025; originally announced March 2025.

    Comments: Accepted by CVPR 2025

  14. arXiv:2503.10148  [pdf, other

    cs.CV

    3D Student Splatting and Scooping

    Authors: Jialin Zhu, Jiangbei Yue, Feixiang He, He Wang

    Abstract: Recently, 3D Gaussian Splatting (3DGS) provides a new framework for novel view synthesis, and has spiked a new wave of research in neural rendering and related applications. As 3DGS is becoming a foundational component of many models, any improvement on 3DGS itself can bring huge benefits. To this end, we aim to improve the fundamental paradigm and formulation of 3DGS. We argue that as an unnormal… ▽ More

    Submitted 11 April, 2025; v1 submitted 13 March, 2025; originally announced March 2025.

  15. arXiv:2502.13071  [pdf, other

    cs.CV

    RobuRCDet: Enhancing Robustness of Radar-Camera Fusion in Bird's Eye View for 3D Object Detection

    Authors: Jingtong Yue, Zhiwei Lin, Xin Lin, Xiaoyu Zhou, Xiangtai Li, Lu Qi, Yongtao Wang, Ming-Hsuan Yang

    Abstract: While recent low-cost radar-camera approaches have shown promising results in multi-modal 3D object detection, both sensors face challenges from environmental and intrinsic disturbances. Poor lighting or adverse weather conditions degrade camera performance, while radar suffers from noise and positional ambiguity. Achieving robust radar-camera 3D object detection requires consistent performance ac… ▽ More

    Submitted 18 February, 2025; originally announced February 2025.

    Comments: Accepted by ICLR2025

  16. arXiv:2502.12167  [pdf

    cs.LG cs.AI

    TastepepAI, An artificial intelligence platform for taste peptide de novo design

    Authors: Jianda Yue, Tingting Li, Jian Ouyang, Jiawei Xu, Hua Tan, Zihui Chen, Changsheng Han, Huanyu Li, Songping Liang, Zhonghua Liu, Zhonghua Liu, Ying Wang

    Abstract: Taste peptides have emerged as promising natural flavoring agents attributed to their unique organoleptic properties, high safety profile, and potential health benefits. However, the de novo identification of taste peptides derived from animal, plant, or microbial sources remains a time-consuming and resource-intensive process, significantly impeding their widespread application in the food indust… ▽ More

    Submitted 12 February, 2025; originally announced February 2025.

    Comments: 40 pages, 6 figures, research article

  17. arXiv:2502.05330  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Multi-Class Segmentation of Aortic Branches and Zones in Computed Tomography Angiography: The AortaSeg24 Challenge

    Authors: Muhammad Imran, Jonathan R. Krebs, Vishal Balaji Sivaraman, Teng Zhang, Amarjeet Kumar, Walker R. Ueland, Michael J. Fassler, Jinlong Huang, Xiao Sun, Lisheng Wang, Pengcheng Shi, Maximilian Rokuss, Michael Baumgartner, Yannick Kirchhof, Klaus H. Maier-Hein, Fabian Isensee, Shuolin Liu, Bing Han, Bong Thanh Nguyen, Dong-jin Shin, Park Ji-Woo, Mathew Choi, Kwang-Hyun Uhm, Sung-Jea Ko, Chanwoong Lee , et al. (38 additional authors not shown)

    Abstract: Multi-class segmentation of the aorta in computed tomography angiography (CTA) scans is essential for diagnosing and planning complex endovascular treatments for patients with aortic dissections. However, existing methods reduce aortic segmentation to a binary problem, limiting their ability to measure diameters across different branches and zones. Furthermore, no open-source dataset is currently… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

  18. arXiv:2502.02277  [pdf, other

    cs.LG cs.AI

    Error Distribution Smoothing:Advancing Low-Dimensional Imbalanced Regression

    Authors: Donghe Chen, Jiaxuan Yue, Tengjie Zheng, Lanxuan Wang, Lin Cheng

    Abstract: In real-world regression tasks, datasets frequently exhibit imbalanced distributions, characterized by a scarcity of data in high-complexity regions and an abundance in low-complexity areas. This imbalance presents significant challenges for existing classification methods with clear class boundaries, while highlighting a scarcity of approaches specifically designed for imbalanced regression probl… ▽ More

    Submitted 4 February, 2025; originally announced February 2025.

    Comments: 16 pages, 12 figures

  19. arXiv:2412.03198  [pdf, ps, other

    math.AP

    The interaction between rough vortex patch and boundary layer

    Authors: Jingchi Huang, Chao Wang, Jingchao Yue, Zhifei Zhang

    Abstract: In this paper, we study the asymptotic behavior of the solution of the Navier-Stokes equations in the half plane at high Reynolds number regime, when the initial vorticity belongs to the Yudovich class and is supported away from the boundary. We prove the $L^p$ ($2\leq p< \infty$) convergence from the Naiver-Stokes equations to the Euler equations. The key point is to introduce a good functional f… ▽ More

    Submitted 4 December, 2024; v1 submitted 4 December, 2024; originally announced December 2024.

    Comments: 30 pages. First submission on September 11th

  20. arXiv:2411.18795  [pdf, other

    cs.CV

    GloFinder: AI-empowered QuPath Plugin for WSI-level Glomerular Detection, Visualization, and Curation

    Authors: Jialin Yue, Tianyuan Yao, Ruining Deng, Siqi Lu, Junlin Guo, Quan Liu, Mengmeng Yin, Juming Xiong, Haichun Yang, Yuankai Huo

    Abstract: Artificial intelligence (AI) has demonstrated significant success in automating the detection of glomeruli, the key functional units of the kidney, from whole slide images (WSIs) in kidney pathology. However, existing open-source tools are often distributed as source code or Docker containers, requiring advanced programming skills that hinder accessibility for non-programmers, such as clinicians.… ▽ More

    Submitted 27 November, 2024; originally announced November 2024.

  21. arXiv:2411.17390  [pdf, other

    eess.IV cs.CV

    Dual-Representation Interaction Driven Image Quality Assessment with Restoration Assistance

    Authors: Jingtong Yue, Xin Lin, Zijiu Yang, Chao Ren

    Abstract: No-Reference Image Quality Assessment for distorted images has always been a challenging problem due to image content variance and distortion diversity. Previous IQA models mostly encode explicit single-quality features of synthetic images to obtain quality-aware representations for quality score prediction. However, performance decreases when facing real-world distortion and restored images from… ▽ More

    Submitted 26 November, 2024; originally announced November 2024.

    Comments: 8 pages,6 figures, published to WACV

  22. arXiv:2411.15942  [pdf, other

    eess.IV cs.CV

    Cross-organ Deployment of EOS Detection AI without Retraining: Feasibility and Limitation

    Authors: Yifei Wu, Juming Xiong, Tianyuan Yao, Ruining Deng, Junlin Guo, Jialin Yue, Naweed Chowdhury, Yuankai Huo

    Abstract: Chronic rhinosinusitis (CRS) is characterized by persistent inflammation in the paranasal sinuses, leading to typical symptoms of nasal congestion, facial pressure, olfactory dysfunction, and discolored nasal drainage, which can significantly impact quality-of-life. Eosinophils (Eos), a crucial component in the mucosal immune response, have been linked to disease severity in CRS. The diagnosis of… ▽ More

    Submitted 24 November, 2024; originally announced November 2024.

    Comments: 8 pages, 5 figures. Accepted by SPIE Medical Imaging 2025 on October 28, 2024

  23. Bidirectional Optimization onto Thermoelectric Performance via Hydrostatic-Pressure in Chalcopyrite AgXTe2 (X=In, Ga)

    Authors: Siqi Guo, Jincheng Yue, Jiongzhi Zheng, Hui Zhang, Ning Wang, Junda Li, Yanhui Liu, Tian Cui

    Abstract: Pressure tuning has emerged as a powerful strategy for manipulating the thermoelectric properties of materials by inducing structural and electronic modifications. Herein, we systematically investigate the transport properties and thermoelectric performance concerning lattice distortions induced by hydrostatic pressure in Ag-based chalcopyrite AgXTe2 (X=In, Ga). The findings reveal that the lattic… ▽ More

    Submitted 1 November, 2024; originally announced November 2024.

  24. arXiv:2410.14956  [pdf

    cond-mat.mes-hall cond-mat.soft q-bio.QM

    Airborne Biomarker Localization Engine (ABLE) for Open Air Point-of-Care Detection

    Authors: Jingcheng Ma, Megan Laune, Pengju Li, Jing Lu, Jiping Yue, Yueyue Yu, Jessica Cleary, Kaitlyn Oliphant, Zachary Kessler, Erika C. Claud, Bozhi Tian

    Abstract: Unlike biomarkers in biofluids, airborne biomarkers are dilute and difficult to trace. Detecting diverse airborne biomarkers with sufficient sensitivity typically relies on bulky and expensive equipment like mass spectrometers that remain inaccessible to the general population. Here, we introduce Airborne Biomarker Localization Engine (ABLE), a simple, affordable, and portable platform that can de… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

    Comments: 17 pages, 5 figures. An additional 67-page supplementary materials document containing a detailed description of methods, 15 additional discussions, 30 figures, and 3 tables, will be made available after the manuscript is published after peer-review process

  25. arXiv:2410.03450  [pdf, other

    cs.LG

    MLLM as Retriever: Interactively Learning Multimodal Retrieval for Embodied Agents

    Authors: Junpeng Yue, Xinrun Xu, Börje F. Karlsson, Zongqing Lu

    Abstract: MLLM agents demonstrate potential for complex embodied tasks by retrieving multimodal task-relevant trajectory data. However, current retrieval methods primarily focus on surface-level similarities of textual or visual cues in trajectories, neglecting their effectiveness for the specific task at hand. To address this issue, we propose a novel method, MLLM As ReTriever (MART), which enhances the pe… ▽ More

    Submitted 22 May, 2025; v1 submitted 4 October, 2024; originally announced October 2024.

    Comments: ICLR 2025

  26. arXiv:2409.15522  [pdf, ps, other

    math.CO

    Spanning weakly even trees of graphs

    Authors: Jiangdong Ai, M. N. Ellingham, Zhipeng Gao, Yixuan Huang, Xiangzhou Liu, Songling Shan, Simon Špacapan, Jun Yue

    Abstract: Let $G$ be a graph (with multiple edges allowed) and let $T$ be a tree in $G$. We say that $T$ is $\textit{even}$ if every leaf of $T$ belongs to the same part of the bipartition of $T$, and that $T$ is $\textit{weakly even}$ if every leaf of $T$ that has maximum degree in $G$ belongs to the same part of the bipartition of $T$. We confirm two recent conjectures of Jackson and Yoshimoto by showing… ▽ More

    Submitted 17 October, 2024; v1 submitted 23 September, 2024; originally announced September 2024.

    Comments: 6 pages. This article represents a merger of arXiv:2409.15522v1 and arxiv:2408.07056

    MSC Class: 05C05 (Primary) 05C07 (Secondary)

  27. Efficient Cross-layer Thermal Transport with Atypical Glassy-like Phenomena in Crystalline CsCu$_4$Se$_3$

    Authors: Jincheng Yue, Yanhui Liu, Jiongzhi Zheng

    Abstract: Understanding lattice dynamics and thermal transport in crystalline compounds with intrinsically low lattice thermal conductivity ($κ_L$) is crucial in condensed matter physics. In this work, we investigate the lattice thermal conductivity of crystalline CsCu$_4$Se$_3$ by coupling first-principles anharmonic lattice dynamics with a unified theory of thermal transport. We consider the effects of bo… ▽ More

    Submitted 14 November, 2024; v1 submitted 14 September, 2024; originally announced September 2024.

  28. arXiv:2409.07723  [pdf, other

    cs.CV cs.AI

    Advancing Depth Anything Model for Unsupervised Monocular Depth Estimation in Endoscopy

    Authors: Bojian Li, Bo Liu, Xinning Yao, Jinghua Yue, Fugen Zhou

    Abstract: Depth estimation is a cornerstone of 3D reconstruction and plays a vital role in minimally invasive endoscopic surgeries. However, most current depth estimation networks rely on traditional convolutional neural networks, which are limited in their ability to capture global information. Foundation models offer a promising approach to enhance depth estimation, but those models currently available ar… ▽ More

    Submitted 5 March, 2025; v1 submitted 11 September, 2024; originally announced September 2024.

    Comments: 8 pages, 7 figures

  29. arXiv:2408.15464  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci physics.optics

    Ultrafast symmetry control in photoexcited quantum dots

    Authors: Burak Guzelturk, Joshua Portner, Justin Ondry, Samira Ghanbarzadeh, Mia Tarantola, Ahhyun Jeong, Thomas Field, Alicia M. Chandler, Eliza Wieman, Thomas R. Hopper, Nicolas E. Watkins, Jin Yue, Xinxin Cheng, Ming-Fu Lin, Duan Luo, Patrick L. Kramer, Xiaozhe Shen, Alexander H. Reid, Olaf Borkiewicz, Uta Ruett, Xiaoyi Zhang, Aaron M. Lindenberg, Jihong Ma, Richard Schaller, Dmitri V. Talapin , et al. (1 additional authors not shown)

    Abstract: Symmetry control is essential for realizing unconventional properties, such as ferroelectricity, nonlinear optical responses, and complex topological order, thus it holds promise for the design of emerging quantum and photonic systems. Nevertheless, fast and reversible control of symmetry in materials remains a challenge, especially for nanoscale systems. Here, we unveil reversible symmetry change… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

    Comments: 19 pages, 5 figures

  30. arXiv:2408.14674  [pdf, other

    cs.CV

    gWaveNet: Classification of Gravity Waves from Noisy Satellite Data using Custom Kernel Integrated Deep Learning Method

    Authors: Seraj Al Mahmud Mostafa, Omar Faruque, Chenxi Wang, Jia Yue, Sanjay Purushotham, Jianwu Wang

    Abstract: Atmospheric gravity waves occur in the Earths atmosphere caused by an interplay between gravity and buoyancy forces. These waves have profound impacts on various aspects of the atmosphere, including the patterns of precipitation, cloud formation, ozone distribution, aerosols, and pollutant dispersion. Therefore, understanding gravity waves is essential to comprehend and monitor changes in a wide r… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

    Comments: This paper has been accepted at the 27th International Conference on Pattern Recognition (ICPR) 2024

  31. arXiv:2408.09241  [pdf, other

    cs.CV eess.IV

    Re-boosting Self-Collaboration Parallel Prompt GAN for Unsupervised Image Restoration

    Authors: Xin Lin, Yuyan Zhou, Jingtong Yue, Chao Ren, Kelvin C. K. Chan, Lu Qi, Ming-Hsuan Yang

    Abstract: Unsupervised restoration approaches based on generative adversarial networks (GANs) offer a promising solution without requiring paired datasets. Yet, these GAN-based approaches struggle to surpass the performance of conventional unsupervised GAN-based frameworks without significantly modifying model structures or increasing the computational complexity. To address these issues, we propose a self-… ▽ More

    Submitted 17 August, 2024; originally announced August 2024.

    Comments: This paper is an extended and revised version of our previous work "Unsupervised Image Denoising in Real-World Scenarios via Self-Collaboration Parallel Generative Adversarial Branches"(https://openaccess.thecvf.com/content/ICCV2023/papers/Lin_Unsupervised_Image_Denoising_in_Real-World_Scenarios_via_Self-Collaboration_Parallel_Generative_ICCV_2023_paper.pdf)

  32. arXiv:2408.07056  [pdf, ps, other

    math.CO

    A short note on spanning even trees

    Authors: Jiangdong Ai, Zhipeng Gao, Xiangzhou Liu, Jun Yue

    Abstract: We call a tree $T$ is \emph{even} if every pair of its leaves is joined by a path of even length. Jackson and Yoshimoto~[J. Graph Theory, 2024] conjectured that every $r$-regular nonbipartite connected graph $G$ has a spanning even tree. They verified this conjecture for the case when $G$ has a $2$-factor. In this paper, we prove that the conjecture holds when $r$ is odd, thereby resolving the onl… ▽ More

    Submitted 10 September, 2024; v1 submitted 13 August, 2024; originally announced August 2024.

    Comments: 6 pages

  33. arXiv:2408.05802  [pdf, other

    cs.CV

    Egocentric Vision Language Planning

    Authors: Zhirui Fang, Ming Yang, Weishuai Zeng, Boyu Li, Junpeng Yue, Ziluo Ding, Xiu Li, Zongqing Lu

    Abstract: We explore leveraging large multi-modal models (LMMs) and text2image models to build a more general embodied agent. LMMs excel in planning long-horizon tasks over symbolic abstractions but struggle with grounding in the physical world, often failing to accurately identify object positions in images. A bridge is needed to connect LMMs to the physical world. The paper proposes a novel approach, egoc… ▽ More

    Submitted 11 August, 2024; originally announced August 2024.

  34. arXiv:2407.04206  [pdf, other

    math.NA cs.CE

    Computational Graph Representation of Equations System Constructors in Hierarchical Circuit Simulation

    Authors: Zichao Long, Lin Li, Lei Han, Xianglong Meng, Chongjun Ding, Ruiyan Li, Wu Jiang, Fuchen Ding, Jiaqing Yue, Zhichao Li, Yisheng Hu, Ding Li, Heng Liao

    Abstract: Equations system constructors of hierarchical circuits play a central role in device modeling, nonlinear equations solving, and circuit design automation. However, existing constructors present limitations in applications to different extents. For example, the costs of developing and reusing device models -- especially coarse-grained equivalent models of circuit modules -- remain high while parame… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  35. arXiv:2406.19540  [pdf, other

    cs.CV

    Weighted Circle Fusion: Ensembling Circle Representation from Different Object Detection Results

    Authors: Jialin Yue, Tianyuan Yao, Ruining Deng, Quan Liu, Juming Xiong, Junlin Guo, Haichun Yang, Yuankai Huo

    Abstract: Recently, the use of circle representation has emerged as a method to improve the identification of spherical objects (such as glomeruli, cells, and nuclei) in medical imaging studies. In traditional bounding box-based object detection, combining results from multiple models improves accuracy, especially when real-time processing isn't crucial. Unfortunately, this widely adopted strategy is not re… ▽ More

    Submitted 27 November, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

  36. arXiv:2405.20774  [pdf, other

    cs.CR cs.AI

    Can We Trust Embodied Agents? Exploring Backdoor Attacks against Embodied LLM-based Decision-Making Systems

    Authors: Ruochen Jiao, Shaoyuan Xie, Justin Yue, Takami Sato, Lixu Wang, Yixuan Wang, Qi Alfred Chen, Qi Zhu

    Abstract: Large Language Models (LLMs) have shown significant promise in real-world decision-making tasks for embodied artificial intelligence, especially when fine-tuned to leverage their inherent common sense and reasoning abilities while being tailored to specific applications. However, this fine-tuning process introduces considerable safety and security vulnerabilities, especially in safety-critical cyb… ▽ More

    Submitted 30 April, 2025; v1 submitted 27 May, 2024; originally announced May 2024.

    Comments: Accepted paper at ICLR 2025, 31 pages, including main paper, references, and appendix

  37. arXiv:2405.19801  [pdf, other

    physics.space-ph

    Modeling of Nitric Oxide Infrared radiative flux in lower thermosphere: a machine learning perspective

    Authors: Dayakrishna Nailwal, MV Sunil Krishna, Alok Kumar Ranjan, Jia Yue

    Abstract: Nitric Oxide (NO) significantly impacts energy distribution and chemical processes in the mesosphere and lower thermosphere (MLT). During geomagnetic storms, a substantial influx of energy in the thermosphere leads to an increase in NO infrared emissions. Accurately predicting the radiative flux of Nitric Oxide is crucial for understanding the thermospheric energy budget, particularly during extre… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 18 pages, 7 figures

    Journal ref: Under review in Advances in Space Research 2024

  38. arXiv:2405.13199  [pdf, ps, other

    eess.IV cs.CV

    TauAD: MRI-free Tau Anomaly Detection in PET Imaging via Conditioned Diffusion Models

    Authors: Lujia Zhong, Shuo Huang, Jiaxin Yue, Jianwei Zhang, Zhiwei Deng, Wenhao Chi, Yonggang Shi

    Abstract: The emergence of tau PET imaging over the last decade has enabled Alzheimer's disease (AD) researchers to examine tau pathology in vivo and more effectively characterize the disease trajectories of AD. Current tau PET analysis methods, however, typically perform inferences on large cortical ROIs and are limited in the detection of localized tau pathology that varies across subjects. Furthermore, a… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  39. arXiv:2405.12806  [pdf, other

    cs.CV

    MOSS: Motion-based 3D Clothed Human Synthesis from Monocular Video

    Authors: Hongsheng Wang, Xiang Cai, Xi Sun, Jinhong Yue, Zhanyun Tang, Shengyu Zhang, Feng Lin, Fei Wu

    Abstract: Single-view clothed human reconstruction holds a central position in virtual reality applications, especially in contexts involving intricate human motions. It presents notable challenges in achieving realistic clothing deformation. Current methodologies often overlook the influence of motion on surface deformation, resulting in surfaces lacking the constraints imposed by global motion. To overcom… ▽ More

    Submitted 21 June, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: text overlap with arXiv:1710.03746 by other authors

  40. arXiv:2405.07411  [pdf, other

    cs.CV cs.AI

    MoVL:Exploring Fusion Strategies for the Domain-Adaptive Application of Pretrained Models in Medical Imaging Tasks

    Authors: Haijiang Tian, Jingkun Yue, Xiaohong Liu, Guoxing Yang, Zeyu Jiang, Guangyu Wang

    Abstract: Medical images are often more difficult to acquire than natural images due to the specialism of the equipment and technology, which leads to less medical image datasets. So it is hard to train a strong pretrained medical vision model. How to make the best of natural pretrained vision model and adapt in medical domain still pends. For image classification, a popular method is linear probe (LP). How… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

  41. Hierarchical Characterization of Thermoelectric Performance in Copper-Based Chalcogenide CsCu$_3$S$_2$: Unveiling the role of Anharmonic Lattice Dynamics

    Authors: Jincheng Yue, Jiongzhi Zheng, Junda Li, Xingchen Shen, Wenling Ren, Yanhui Liu, Tian Cui

    Abstract: We explicitly consider both phonon energy shifts and broadening arising from both cubic and quartic anharmonicities, as well as diagonal/non-diagonal terms of heat flux operators in thermal conductivity. Our findings show that the strong anharmonicity of CsCu$_3$S$_2$ primarily arises from the presence of $p$-$d$ anti-bonding hybridization between Cu and S atoms, coupled with the random oscillatio… ▽ More

    Submitted 6 September, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

  42. arXiv:2404.10343  [pdf, other

    cs.CV eess.IV

    The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report

    Authors: Bin Ren, Yawei Li, Nancy Mehta, Radu Timofte, Hongyuan Yu, Cheng Wan, Yuxin Hong, Bingnan Han, Zhuoyuan Wu, Yajun Zou, Yuqing Liu, Jizhe Li, Keji He, Chao Fan, Heng Zhang, Xiaolin Zhang, Xuanwu Yin, Kunlong Zuo, Bohao Liao, Peizhe Xia, Long Peng, Zhibo Du, Xin Di, Wangkai Li, Yang Wang , et al. (109 additional authors not shown)

    Abstract: This paper provides a comprehensive review of the NTIRE 2024 challenge, focusing on efficient single-image super-resolution (ESR) solutions and their outcomes. The task of this challenge is to super-resolve an input image with a magnification factor of x4 based on pairs of low and corresponding high-resolution images. The primary objective is to develop networks that optimize various aspects such… ▽ More

    Submitted 25 June, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: The report paper of NTIRE2024 Efficient Super-resolution, accepted by CVPRW2024

  43. Diffusion Models Meet Remote Sensing: Principles, Methods, and Perspectives

    Authors: Yidan Liu, Jun Yue, Shaobo Xia, Pedram Ghamisi, Weiying Xie, Leyuan Fang

    Abstract: As a newly emerging advance in deep generative models, diffusion models have achieved state-of-the-art results in many fields, including computer vision, natural language processing, and molecule design. The remote sensing (RS) community has also noticed the powerful ability of diffusion models and quickly applied them to a variety of tasks for image processing. Given the rapid increase in researc… ▽ More

    Submitted 11 November, 2024; v1 submitted 13 April, 2024; originally announced April 2024.

    Journal ref: in IEEE Transactions on Geoscience and Remote Sensing, vol. 62, pp. 1-22, 2024, Art no. 4708322

  44. arXiv:2404.08215  [pdf, other

    cond-mat.mes-hall

    Stability and noncentered PT symmetry of real topological phases

    Authors: S. J. Yue, Qing Liu, Shengyuan A. Yang, Y. X. Zhao

    Abstract: Real topological phases protected by the spacetime inversion (P T) symmetry are a current research focus. The basis is that the P T symmetry endows a real structure in momentum space, which leads to Z2 topological classifications in 1D and 2D. Here, we provide solutions to two outstanding problems in the diagnosis of real topology. First, based on the stable equivalence in K-theory, we clarify tha… ▽ More

    Submitted 16 April, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

  45. arXiv:2403.20276  [pdf, other

    hep-ex hep-ph physics.ins-det

    Constraints on the Blazar-Boosted Dark Matter from the CDEX-10 Experiment

    Authors: R. Xu, L. T. Yang, Q. Yue, K. J. Kang, Y. J. Li, H. P. An, Greeshma C., J. P. Chang, Y. H. Chen, J. P. Cheng, W. H. Dai, Z. Deng, C. H. Fang, X. P. Geng, H. Gong, Q. J. Guo, T. Guo, X. Y. Guo, L. He, S. M. He, J. W. Hu, H. X. Huang, T. C. Huang, L. Jiang, S. Karmakar , et al. (59 additional authors not shown)

    Abstract: We report new constraints on light dark matter (DM) boosted by blazars using the 205.4 kg day data from the CDEX-10 experiment located at the China Jinping Underground Laboratory. Two representative blazars, TXS 0506+56 and BL Lacertae are studied. The results derived from TXS 0506+56 exclude DM-nucleon elastic scattering cross sections from $4.6\times 10^{-33}\ \rm cm^2$ to… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: 7 pages, 4 figures

  46. arXiv:2403.20263  [pdf, other

    hep-ex hep-ph physics.ins-det

    Probing Dark Matter Particles from Evaporating Primordial Black Holes via Electron Scattering in the CDEX-10 Experiment

    Authors: Z. H. Zhang, L. T. Yang, Q. Yue, K. J. Kang, Y. J. Li, H. P. An, Greeshma C., J. P. Chang, Y. H. Chen, J. P. Cheng, W. H. Dai, Z. Deng, C. H. Fang, X. P. Geng, H. Gong, Q. J. Guo, T. Guo, X. Y. Guo, L. He, S. M. He, J. W. Hu, H. X. Huang, T. C. Huang, L. Jiang, S. Karmakar , et al. (59 additional authors not shown)

    Abstract: Dark matter (DM) is a major constituent of the Universe. However, no definite evidence of DM particles (denoted as ``$χ$") has been found in DM direct detection (DD) experiments to date. There is a novel concept of detecting $χ$ from evaporating primordial black holes (PBHs). We search for $χ$ emitted from PBHs by investigating their interaction with target electrons. The examined PBH masses range… ▽ More

    Submitted 22 September, 2024; v1 submitted 29 March, 2024; originally announced March 2024.

    Comments: 9 pages, 6 figures, 3 tables. Version updated to match SCPMA version

    Journal ref: Sci. China Phys. Mech. Astron. 67, 101011 (2024)

  47. arXiv:2403.15891  [pdf, other

    cs.CV

    Human Motion Prediction under Unexpected Perturbation

    Authors: Jiangbei Yue, Baiyi Li, Julien Pettré, Armin Seyfried, He Wang

    Abstract: We investigate a new task in human motion prediction, which is predicting motions under unexpected physical perturbation potentially involving multiple people. Compared with existing research, this task involves predicting less controlled, unpremeditated and pure reactive motions in response to external impact and how such motions can propagate through people. It brings new challenges such as data… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  48. arXiv:2403.14362  [pdf, other

    cs.CV

    Enabling Generalized Zero-shot Learning Towards Unseen Domains by Intrinsic Learning from Redundant LLM Semantics

    Authors: Jiaqi Yue, Chunhui Zhao, Jiancheng Zhao, Biao Huang

    Abstract: Generalized zero-shot learning (GZSL) focuses on recognizing seen and unseen classes against domain shift problem where data of unseen classes may be misclassified as seen classes. However, existing GZSL is still limited to seen domains. In the current work, we study cross-domain GZSL (CDGZSL) which addresses GZSL towards unseen domains. Different from existing GZSL methods, CDGZSL constructs a co… ▽ More

    Submitted 10 March, 2025; v1 submitted 21 March, 2024; originally announced March 2024.

  49. arXiv:2403.13845  [pdf, other

    cs.LG cs.AI

    Learning to better see the unseen: Broad-Deep Mixed Anti-Forgetting Framework for Incremental Zero-Shot Fault Diagnosis

    Authors: Jiancheng Zhao, Jiaqi Yue, Chunhui Zhao

    Abstract: Zero-shot fault diagnosis (ZSFD) is capable of identifying unseen faults via predicting fault attributes labeled by human experts. We first recognize the demand of ZSFD to deal with continuous changes in industrial processes, i.e., the model's ability to adapt to new fault categories and attributes while avoiding forgetting the diagnosis ability learned previously. To overcome the issue that the e… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  50. arXiv:2403.03186  [pdf, other

    cs.AI

    Cradle: Empowering Foundation Agents Towards General Computer Control

    Authors: Weihao Tan, Wentao Zhang, Xinrun Xu, Haochong Xia, Ziluo Ding, Boyu Li, Bohan Zhou, Junpeng Yue, Jiechuan Jiang, Yewen Li, Ruyi An, Molei Qin, Chuqiao Zong, Longtao Zheng, Yujie Wu, Xiaoqiang Chai, Yifei Bi, Tianbao Xie, Pengjie Gu, Xiyun Li, Ceyao Zhang, Long Tian, Chaojie Wang, Xinrun Wang, Börje F. Karlsson , et al. (3 additional authors not shown)

    Abstract: Despite the success in specific scenarios, existing foundation agents still struggle to generalize across various virtual scenarios, mainly due to the dramatically different encapsulations of environments with manually designed observation and action spaces. To handle this issue, we propose the General Computer Control (GCC) setting to restrict foundation agents to interact with software through t… ▽ More

    Submitted 2 July, 2024; v1 submitted 5 March, 2024; originally announced March 2024.