Skip to main content

Showing 1–50 of 110 results for author: Lu, M

Searching in archive eess. Search in all archives.
.
  1. arXiv:2507.01608  [pdf, ps, other

    cs.CV eess.IV

    Perception-Oriented Latent Coding for High-Performance Compressed Domain Semantic Inference

    Authors: Xu Zhang, Ming Lu, Yan Chen, Zhan Ma

    Abstract: In recent years, compressed domain semantic inference has primarily relied on learned image coding models optimized for mean squared error (MSE). However, MSE-oriented optimization tends to yield latent spaces with limited semantic richness, which hinders effective semantic inference in downstream tasks. Moreover, achieving high performance with these models often requires fine-tuning the entire v… ▽ More

    Submitted 2 July, 2025; originally announced July 2025.

    Comments: International Conference on Multimedia and Expo (ICME), 2025

  2. arXiv:2506.19222  [pdf, ps, other

    eess.IV cs.CV

    Deformable Medical Image Registration with Effective Anatomical Structure Representation and Divide-and-Conquer Network

    Authors: Xinke Ma, Yongsheng Pan, Qingjie Zeng, Mengkang Lu, Bolysbek Murat Yerzhanuly, Bazargul Matkerim, Yong Xia

    Abstract: Effective representation of Regions of Interest (ROI) and independent alignment of these ROIs can significantly enhance the performance of deformable medical image registration (DMIR). However, current learning-based DMIR methods have limitations. Unsupervised techniques disregard ROI representation and proceed directly with aligning pairs of images, while weakly-supervised methods heavily depend… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

  3. arXiv:2506.18635  [pdf

    eess.SY physics.app-ph

    Hybrid Single-Pulse and Sawyer-Tower Method for Accurate Transistor Loss Separation in High-Frequency High-Efficiency Power Converters

    Authors: Xiaoyang Tian, Mowei Lu, Florin Udrea, Stephan Goetz

    Abstract: Accurate measurement of transistor parasitic capacitance and its associated energy losses is critical for evaluating device performance, particularly in high-frequency and high-efficiency power conversion systems. This paper proposes a hybrid single-pulse and Sawyer-Tower test method to analyse switching characteristics of field-effect transistors (FET), which not only eliminates overlap losses bu… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

    Comments: 5 pages, 8 figures

  4. arXiv:2505.21838  [pdf, ps, other

    eess.SY cs.AI math.OC nlin.CD

    Nonadaptive Output Regulation of Second-Order Nonlinear Uncertain Systems

    Authors: Maobin Lu, Martin Guay, Telema Harry, Shimin Wang, Jordan Cooper

    Abstract: This paper investigates the robust output regulation problem of second-order nonlinear uncertain systems with an unknown exosystem. Instead of the adaptive control approach, this paper resorts to a robust control methodology to solve the problem and thus avoid the bursting phenomenon. In particular, this paper constructs generic internal models for the steady-state state and input variables of the… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Comments: 8 pages, 3 figures

  5. arXiv:2505.08281  [pdf, ps, other

    cs.CV eess.IV

    Ultra Lowrate Image Compression with Semantic Residual Coding and Compression-aware Diffusion

    Authors: Anle Ke, Xu Zhang, Tong Chen, Ming Lu, Chao Zhou, Jiawen Gu, Zhan Ma

    Abstract: Existing multimodal large model-based image compression frameworks often rely on a fragmented integration of semantic retrieval, latent compression, and generative models, resulting in suboptimal performance in both reconstruction fidelity and coding efficiency. To address these challenges, we propose a residual-guided ultra lowrate image compression named ResULIC, which incorporates residual sign… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

    Journal ref: ICML 2025

  6. arXiv:2503.21820  [pdf, other

    cs.CV eess.IV

    UFM: Unified Feature Matching Pre-training with Multi-Modal Image Assistants

    Authors: Yide Di, Yun Liao, Hao Zhou, Kaijun Zhu, Qing Duan, Junhui Liu, Mingyu Lu

    Abstract: Image feature matching, a foundational task in computer vision, remains challenging for multimodal image applications, often necessitating intricate training on specific datasets. In this paper, we introduce a Unified Feature Matching pre-trained model (UFM) designed to address feature matching challenges across a wide spectrum of modal images. We present Multimodal Image Assistant (MIA) transform… ▽ More

    Submitted 26 March, 2025; originally announced March 2025.

    Comments: 34 pages, 13 figures

  7. Flying in Highly Dynamic Environments with End-to-end Learning Approach

    Authors: Xiyu Fan, Minghao Lu, Bowen Xu, Peng Lu

    Abstract: Obstacle avoidance for unmanned aerial vehicles like quadrotors is a popular research topic. Most existing research focuses only on static environments, and obstacle avoidance in environments with multiple dynamic obstacles remains challenging. This paper proposes a novel deep-reinforcement learning-based approach for the quadrotors to navigate through highly dynamic environments. We propose a lid… ▽ More

    Submitted 18 March, 2025; originally announced March 2025.

    Comments: IEEE Robotics and Automation Letters (2025)

  8. arXiv:2503.07667  [pdf, other

    cs.LG cs.AI cs.CV eess.SP

    CLIMB: Data Foundations for Large Scale Multimodal Clinical Foundation Models

    Authors: Wei Dai, Peilin Chen, Malinda Lu, Daniel Li, Haowen Wei, Hejie Cui, Paul Pu Liang

    Abstract: Recent advances in clinical AI have enabled remarkable progress across many clinical domains. However, existing benchmarks and models are primarily limited to a small set of modalities and tasks, which hinders the development of large-scale multimodal methods that can make holistic assessments of patient health and well-being. To bridge this gap, we introduce Clinical Large-Scale Integrative Multi… ▽ More

    Submitted 20 March, 2025; v1 submitted 8 March, 2025; originally announced March 2025.

  9. arXiv:2503.06226  [pdf, ps, other

    eess.SY cs.AI cs.MA math.OC

    Optimal Output Feedback Learning Control for Discrete-Time Linear Quadratic Regulation

    Authors: Kedi Xie, Martin Guay, Shimin Wang, Fang Deng, Maobin Lu

    Abstract: This paper studies the linear quadratic regulation (LQR) problem of unknown discrete-time systems via dynamic output feedback learning control. In contrast to the state feedback, the optimality of the dynamic output feedback control for solving the LQR problem requires an implicit condition on the convergence of the state observer. Moreover, due to unknown system matrices and the existence of obse… ▽ More

    Submitted 27 May, 2025; v1 submitted 8 March, 2025; originally announced March 2025.

    Comments: 16 pages, 5 figures

  10. arXiv:2502.13915  [pdf, other

    eess.SY

    Conveniently Identify Coils in Inductive Power Transfer System Using Machine Learning

    Authors: Yifan Zhao, Mowei Lu, Ting Chen, Heyuan Li, Xiang Gao, Zhenbin Zhang, Minfan Fu, Stefan M. Goetz

    Abstract: High-frequency inductive power transfer (IPT) has garnered significant attention in recent years due to its long transmission distance and high efficiency. The inductance values L and quality factors Q of the transmitting and receiving coils greatly influence the system's operation. Traditional methods involved impedance analyzers or network analyzers for measurement, which required bulky and cost… ▽ More

    Submitted 19 February, 2025; originally announced February 2025.

    Comments: This paper has accepted in 2025 IEEE Applied Power Electronics Conference and Exposition (APEC)

  11. arXiv:2502.13880  [pdf, other

    eess.SY

    Class E/EF Inductive Power Transfer to Achieve Stable Output under Variable Low Coupling

    Authors: Yifan Zhao, Mowei Lu, Heyuan Li, Zhenbin Zhang, Minfan Fu, Stefan M. Goetz

    Abstract: This paper develops an inductive power transfer(IPT)system with stable output power based on a Class E/EF inverter. Load-independent design of Class E/EF inverter has recently attracted widespread interest. However, applying this design to IPT systems has proven challenging when the coupling coefficient is weak. To solve this issue, this paper uses an expanded impedance model and substitutes the s… ▽ More

    Submitted 19 February, 2025; originally announced February 2025.

    Comments: This paper has been accepted in 2025 IEEE Conference on Applied Power Electronics Conference and Exposition (APEC)

  12. arXiv:2502.13395  [pdf

    cs.SD cs.LG eess.AS eess.SP physics.optics

    Unsupervised CP-UNet Framework for Denoising DAS Data with Decay Noise

    Authors: Tianye Huang, Aopeng Li, Xiang Li, Jing Zhang, Sijing Xian, Qi Zhang, Mingkong Lu, Guodong Chen, Liangming Xiong, Xiangyun Hu

    Abstract: Distributed acoustic sensor (DAS) technology leverages optical fiber cables to detect acoustic signals, providing cost-effective and dense monitoring capabilities. It offers several advantages including resistance to extreme conditions, immunity to electromagnetic interference, and accurate detection. However, DAS typically exhibits a lower signal-to-noise ratio (S/N) compared to geophones and is… ▽ More

    Submitted 18 February, 2025; originally announced February 2025.

    Comments: 13 pages, 8 figures

  13. arXiv:2502.11729  [pdf, other

    eess.IV

    On Quantizing Neural Representation for Variable-Rate Video Coding

    Authors: Junqi Shi, Zhujia Chen, Hanfei Li, Qi Zhao, Ming Lu, Tong Chen, Zhan Ma

    Abstract: This work introduces NeuroQuant, a novel post-training quantization (PTQ) approach tailored to non-generalized Implicit Neural Representations for variable-rate Video Coding (INR-VC). Unlike existing methods that require extensive weight retraining for each target bitrate, we hypothesize that variable-rate coding can be achieved by adjusting quantization parameters (QPs) of pre-trained weights. Ou… ▽ More

    Submitted 17 February, 2025; originally announced February 2025.

    Comments: to be pulished in ICLR'25

  14. arXiv:2502.04988  [pdf, other

    eess.IV cs.CV

    CMamba: Learned Image Compression with State Space Models

    Authors: Zhuojie Wu, Heming Du, Shuyun Wang, Ming Lu, Haiyang Sun, Yandong Guo, Xin Yu

    Abstract: Learned Image Compression (LIC) has explored various architectures, such as Convolutional Neural Networks (CNNs) and transformers, in modeling image content distributions in order to achieve compression effectiveness. However, achieving high rate-distortion performance while maintaining low computational complexity (\ie, parameters, FLOPs, and latency) remains challenging. In this paper, we propos… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

  15. arXiv:2501.11263  [pdf, other

    cs.CV eess.IV

    Towards Loss-Resilient Image Coding for Unstable Satellite Networks

    Authors: Hongwei Sha, Muchen Dong, Quanyou Luo, Ming Lu, Hao Chen, Zhan Ma

    Abstract: Geostationary Earth Orbit (GEO) satellite communication demonstrates significant advantages in emergency short burst data services. However, unstable satellite networks, particularly those with frequent packet loss, present a severe challenge to accurate image transmission. To address it, we propose a loss-resilient image coding approach that leverages end-to-end optimization in learned image comp… ▽ More

    Submitted 19 January, 2025; originally announced January 2025.

    Comments: Accepted as a poster presentation at AAAI 2025

  16. arXiv:2501.08825  [pdf, other

    eess.SP

    A Multi-modal Intelligent Channel Model for 6G Multi-UAV-to-Multi-Vehicle Communications

    Authors: Lu Bai, Mengyuan Lu, Ziwei Huang, Xiang Cheng

    Abstract: In this paper, a novel multi-modal intelligent channel model for sixth-generation (6G) multiple-unmanned aerial vehicle (multi-UAV)-to-multi-vehicle communications is proposed. To thoroughly explore the mapping relationship between the physical environment and the electromagnetic space in the complex multi-UAV-to-multi-vehicle scenario, two new parameters, i.e., terrestrial traffic density (TTD) a… ▽ More

    Submitted 15 January, 2025; originally announced January 2025.

  17. arXiv:2411.19666  [pdf, other

    eess.IV cs.AI cs.CV cs.LG stat.AP

    Multimodal Whole Slide Foundation Model for Pathology

    Authors: Tong Ding, Sophia J. Wagner, Andrew H. Song, Richard J. Chen, Ming Y. Lu, Andrew Zhang, Anurag J. Vaidya, Guillaume Jaume, Muhammad Shaban, Ahrong Kim, Drew F. K. Williamson, Bowen Chen, Cristina Almagro-Perez, Paul Doucet, Sharifa Sahai, Chengkuan Chen, Daisuke Komura, Akihiro Kawabe, Shumpei Ishikawa, Georg Gerber, Tingying Peng, Long Phi Le, Faisal Mahmood

    Abstract: The field of computational pathology has been transformed with recent advances in foundation models that encode histopathology region-of-interests (ROIs) into versatile and transferable feature representations via self-supervised learning (SSL). However, translating these advancements to address complex clinical challenges at the patient and slide level remains constrained by limited clinical data… ▽ More

    Submitted 29 November, 2024; originally announced November 2024.

    Comments: The code is accessible at https://github.com/mahmoodlab/TITAN

  18. arXiv:2410.23695  [pdf, other

    eess.SP

    Parameterized TDOA: Instantaneous TDOA Estimation and Localization for Mobile Targets in a Time-Division Broadcast Positioning System

    Authors: Chenxin Tu, Xiaowei Cui, Gang Liu, Sihao Zhao, Mingquan Lu

    Abstract: In a time-division broadcast positioning system (TDBPS), localizing mobile targets using classical time difference of arrival (TDOA) methods poses significant challenges. Concurrent TDOA measurements are infeasible because targets receive signals from different anchors and extract their transmission times at different reception times, as well as at varying positions. Traditional TDOA estimation sc… ▽ More

    Submitted 22 March, 2025; v1 submitted 31 October, 2024; originally announced October 2024.

    Comments: This manuscript has been accepted for publication in IEEE Internet of Things Journal. The final version will be available at DOI: 10.1109/JIOT.2025.3554528

  19. arXiv:2410.07277  [pdf, other

    eess.AS cs.AI cs.CL cs.SD

    Swin-BERT: A Feature Fusion System designed for Speech-based Alzheimer's Dementia Detection

    Authors: Yilin Pan, Yanpei Shi, Yijia Zhang, Mingyu Lu

    Abstract: Speech is usually used for constructing an automatic Alzheimer's dementia (AD) detection system, as the acoustic and linguistic abilities show a decline in people living with AD at the early stages. However, speech includes not only AD-related local and global information but also other information unrelated to cognitive status, such as age and gender. In this paper, we propose a speech-based syst… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

  20. arXiv:2410.02598  [pdf, other

    eess.IV cs.CV

    High-Efficiency Neural Video Compression via Hierarchical Predictive Learning

    Authors: Ming Lu, Zhihao Duan, Wuyang Cong, Dandan Ding, Fengqing Zhu, Zhan Ma

    Abstract: The enhanced Deep Hierarchical Video Compression-DHVC 2.0-has been introduced. This single-model neural video codec operates across a broad range of bitrates, delivering not only superior compression performance to representative methods but also impressive complexity efficiency, enabling real-time processing with a significantly smaller memory footprint on standard GPUs. These remarkable advancem… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

  21. arXiv:2409.19660  [pdf, other

    cs.CV eess.IV

    All-in-One Image Coding for Joint Human-Machine Vision with Multi-Path Aggregation

    Authors: Xu Zhang, Peiyao Guo, Ming Lu, Zhan Ma

    Abstract: Image coding for multi-task applications, catering to both human perception and machine vision, has been extensively investigated. Existing methods often rely on multiple task-specific encoder-decoder pairs, leading to high overhead of parameter and bitrate usage, or face challenges in multi-objective optimization under a unified representation, failing to achieve both performance and efficiency.… ▽ More

    Submitted 29 September, 2024; originally announced September 2024.

    Comments: NeurIPS 2024

  22. arXiv:2409.01009  [pdf, other

    eess.IV

    Accelerating block-level rate control for learned image compression

    Authors: Muchen Dong, Ming Lu, Zhan Ma

    Abstract: Despite the unprecedented compression efficiency achieved by deep learned image compression (LIC), existing methods usually approximate the desired bitrate by adjusting a single quality factor for a given input image, which may compromise the rate control results. Considering the Rate-Distortion (R - D) characteristics of different spatial content, this work introduces the block-level rate control… ▽ More

    Submitted 2 September, 2024; originally announced September 2024.

    Comments: 10 pages, 5 figures

    MSC Class: 68P30 ACM Class: I.4.2

  23. arXiv:2407.21395  [pdf, other

    eess.IV

    HINER: Neural Representation for Hyperspectral Image

    Authors: Junqi Shi, Mingyi Jiang, Ming Lu, Tong Chen, Xun Cao, Zhan Ma

    Abstract: This paper introduces {HINER}, a novel neural representation for compressing HSI and ensuring high-quality downstream tasks on compressed HSI. HINER fully exploits inter-spectral correlations by explicitly encoding of spectral wavelengths and achieves a compact representation of the input HSI sample through joint optimization with a learnable decoder. By additionally incorporating the Content Angl… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

    Comments: ACM MM24

  24. arXiv:2405.10570  [pdf

    eess.IV cs.AI

    Simultaneous Deep Learning of Myocardium Segmentation and T2 Quantification for Acute Myocardial Infarction MRI

    Authors: Yirong Zhou, Chengyan Wang, Mengtian Lu, Kunyuan Guo, Zi Wang, Dan Ruan, Rui Guo, Peijun Zhao, Jianhua Wang, Naiming Wu, Jianzhong Lin, Yinyin Chen, Hang Jin, Lianxin Xie, Lilan Wu, Liuhong Zhu, Jianjun Zhou, Congbo Cai, He Wang, Xiaobo Qu

    Abstract: In cardiac Magnetic Resonance Imaging (MRI) analysis, simultaneous myocardial segmentation and T2 quantification are crucial for assessing myocardial pathologies. Existing methods often address these tasks separately, limiting their synergistic potential. To address this, we propose SQNet, a dual-task network integrating Transformer and Convolutional Neural Network (CNN) components. SQNet features… ▽ More

    Submitted 29 May, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

    Comments: 10 pages, 8 figures, 6 tables

  25. arXiv:2404.08285  [pdf

    cs.CV cs.AI eess.SY

    A Survey of Neural Network Robustness Assessment in Image Recognition

    Authors: Jie Wang, Jun Ai, Minyan Lu, Haoran Su, Dan Yu, Yutao Zhang, Junda Zhu, Jingyu Liu

    Abstract: In recent years, there has been significant attention given to the robustness assessment of neural networks. Robustness plays a critical role in ensuring reliable operation of artificial intelligence (AI) systems in complex and uncertain environments. Deep learning's robustness problem is particularly significant, highlighted by the discovery of adversarial attacks on image classification models.… ▽ More

    Submitted 15 April, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

    Comments: Corrected typos and grammatical errors in Section 5

  26. arXiv:2402.18862  [pdf, other

    eess.IV

    Towards Backward-Compatible Continual Learning of Image Compression

    Authors: Zhihao Duan, Ming Lu, Justin Yang, Jiangpeng He, Zhan Ma, Fengqing Zhu

    Abstract: This paper explores the possibility of extending the capability of pre-trained neural image compressors (e.g., adapting to new data or target bitrates) without breaking backward compatibility, the ability to decode bitstreams encoded by the original model. We refer to this problem as continual learning of image compression. Our initial findings show that baseline solutions, such as end-to-end fine… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: Accepted to CVPR 2024

  27. arXiv:2402.11164  [pdf

    eess.IV

    TinyLIC-High efficiency lossy image compression method

    Authors: Gaocheng Ma, Yinfeng Chai, Tianhao Jiang, Ming Lu, Tong Chen

    Abstract: Image compression has been the subject of extensive research for several decades, resulting in the development of well-known standards such as JPEG, JPEG2000, and H.264/AVC. However, recent advancements in deep learning have led to the emergence of learned image compression methods that offer significant improvements in coding efficiency compared to traditional codecs. These learned compression te… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  28. arXiv:2401.11615  [pdf, other

    eess.IV

    Another Way to the Top: Exploit Contextual Clustering in Learned Image Coding

    Authors: Yichi Zhang, Zhihao Duan, Ming Lu, Dandan Ding, Fengqing Zhu, Zhan Ma

    Abstract: While convolution and self-attention are extensively used in learned image compression (LIC) for transform coding, this paper proposes an alternative called Contextual Clustering based LIC (CLIC) which primarily relies on clustering operations and local attention for correlation characterization and compact representation of an image. As seen, CLIC expands the receptive field into the entire image… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

    Comments: The 38th Annual AAAI Conference on Artificial Intelligence (AAAI 2024)

  29. arXiv:2401.06148  [pdf, other

    eess.IV cs.AI cs.CV q-bio.QM

    Artificial Intelligence for Digital and Computational Pathology

    Authors: Andrew H. Song, Guillaume Jaume, Drew F. K. Williamson, Ming Y. Lu, Anurag Vaidya, Tiffany R. Miller, Faisal Mahmood

    Abstract: Advances in digitizing tissue slides and the fast-paced progress in artificial intelligence, including deep learning, have boosted the field of computational pathology. This field holds tremendous potential to automate clinical diagnosis, predict patient prognosis and response to therapy, and discover new morphological biomarkers from tissue images. Some of these artificial intelligence-based syst… ▽ More

    Submitted 12 December, 2023; originally announced January 2024.

    Journal ref: Nature Reviews Bioengineering 2023

  30. arXiv:2401.04412  [pdf, other

    eess.IV

    Deep Covariance Alignment for Domain Adaptive Remote Sensing Image Segmentation

    Authors: Linshan Wu, Ming Lu, Leyuan Fang

    Abstract: Unsupervised domain adaptive (UDA) image segmentation has recently gained increasing attention, aiming to improve the generalization capability for transferring knowledge from the source domain to the target domain. However, in high spatial resolution remote sensing image (RSI), the same category from different domains (\emph{e.g.}, urban and rural) can appear to be totally different with extremel… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

    Comments: A paper accepted by TGRS

  31. arXiv:2312.08743  [pdf, other

    cs.RO eess.SY

    FAPP: Fast and Adaptive Perception and Planning for UAVs in Dynamic Cluttered Environments

    Authors: Minghao Lu, Xiyu Fan, Han Chen, Peng Lu

    Abstract: Obstacle avoidance for Unmanned Aerial Vehicles (UAVs) in cluttered environments is significantly challenging. Existing obstacle avoidance for UAVs either focuses on fully static environments or static environments with only a few dynamic objects. In this paper, we take the initiative to consider the obstacle avoidance of UAVs in dynamic cluttered environments in which dynamic objects are the domi… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

  32. arXiv:2312.07126  [pdf, other

    eess.IV

    Deep Hierarchical Video Compression

    Authors: Ming Lu, Zhihao Duan, Fengqing Zhu, Zhan Ma

    Abstract: Recently, probabilistic predictive coding that directly models the conditional distribution of latent features across successive frames for temporal redundancy removal has yielded promising results. Existing methods using a single-scale Variational AutoEncoder (VAE) must devise complex networks for conditional probability estimation in latent space, neglecting multiscale characteristics of video f… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  33. arXiv:2312.01361  [pdf, other

    cs.CV cs.LG eess.IV

    MoEC: Mixture of Experts Implicit Neural Compression

    Authors: Jianchen Zhao, Cheng-Ching Tseng, Ming Lu, Ruichuan An, Xiaobao Wei, He Sun, Shanghang Zhang

    Abstract: Emerging Implicit Neural Representation (INR) is a promising data compression technique, which represents the data using the parameters of a Deep Neural Network (DNN). Existing methods manually partition a complex scene into local regions and overfit the INRs into those regions. However, manually designing the partition scheme for a complex scene is very challenging and fails to jointly learn the… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

  34. arXiv:2311.16572   

    eess.SY physics.ao-ph physics.soc-ph

    Adapting to climate change: Long-term impact of wind resource changes on China's power system resilience

    Authors: Jiaqi Ruan, Xiangrui Meng, Yifan Zhu, Gaoqi Liang, Xianzhuo Sun, Huayi Wu, Huijuan Xiao, Mengqian Lu, Pin Gao, Jiapeng Li, Wai-Kin Wong, Zhao Xu, Junhua Zhao

    Abstract: Modern society's reliance on power systems is at risk from the escalating effects of wind-related climate change. Yet, failure to identify the intricate relationship between wind-related climate risks and power systems could lead to serious short- and long-term issues, including partial or complete blackouts. Here, we develop a comprehensive framework to assess China's power system resilience acro… ▽ More

    Submitted 24 January, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

    Comments: Not suitable for publication

  35. arXiv:2311.16565  [pdf, other

    cs.CV cs.SD eess.AS

    DiffusionTalker: Personalization and Acceleration for Speech-Driven 3D Face Diffuser

    Authors: Peng Chen, Xiaobao Wei, Ming Lu, Yitong Zhu, Naiming Yao, Xingyu Xiao, Hui Chen

    Abstract: Speech-driven 3D facial animation has been an attractive task in both academia and industry. Traditional methods mostly focus on learning a deterministic mapping from speech to animation. Recent approaches start to consider the non-deterministic fact of speech-driven 3D face animation and employ the diffusion model for the task. However, personalizing facial animation and accelerating animation ge… ▽ More

    Submitted 2 December, 2023; v1 submitted 28 November, 2023; originally announced November 2023.

  36. arXiv:2311.02035  [pdf, other

    eess.SY

    A Highly-Compact Direct-Injection Universal Power Flow and Quality Control Circuit

    Authors: Mowei Lu, Mengjie Qin, Jan Kacetl, Eeshta Suresh, Teng Long, Stefan M. Goetz

    Abstract: This paper presents a novel direct-injection modular universal power flow and quality control topology exclusively using lower power components. In addition to conventional high-voltage applications, it is particularly attractive for the distribution and secondary grids, e.g., in soft open points, down to low voltage as it can exploit the latest developments in low-voltage high-current semiconduct… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

  37. arXiv:2310.08292  [pdf, other

    eess.SP cs.AI

    Concealed Electronic Countermeasures of Radar Signal with Adversarial Examples

    Authors: Ruinan Ma, Canjie Zhu, Mingfeng Lu, Yunjie Li, Yu-an Tan, Ruibin Zhang, Ran Tao

    Abstract: Electronic countermeasures involving radar signals are an important aspect of modern warfare. Traditional electronic countermeasures techniques typically add large-scale interference signals to ensure interference effects, which can lead to attacks being too obvious. In recent years, AI-based attack methods have emerged that can effectively solve this problem, but the attack scenarios are currentl… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  38. arXiv:2310.08068  [pdf, other

    eess.IV cs.CV

    Frequency-Aware Re-Parameterization for Over-Fitting Based Image Compression

    Authors: Yun Ye, Yanjie Pan, Qually Jiang, Ming Lu, Xiaoran Fang, Beryl Xu

    Abstract: Over-fitting-based image compression requires weights compactness for compression and fast convergence for practical use, posing challenges for deep convolutional neural networks (CNNs) based methods. This paper presents a simple re-parameterization method to train CNNs with reduced weights storage and accelerated convergence. The convolution kernels are re-parameterized as a weighted sum of discr… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: to be published at ICIP 2023, this version fixed a mistake in Eq. (1) in the proceeding version

  39. arXiv:2310.06162  [pdf

    eess.IV

    Empirical Evaluation of the Segment Anything Model (SAM) for Brain Tumor Segmentation

    Authors: Mohammad Peivandi, Jason Zhang, Michael Lu, Dongxiao Zhu, Zhifeng Kou

    Abstract: Brain tumor segmentation presents a formidable challenge in the field of Medical Image Segmentation. While deep-learning models have been useful, human expert segmentation remains the most accurate method. The recently released Segment Anything Model (SAM) has opened up the opportunity to apply foundation models to this difficult task. However, SAM was primarily trained on diverse natural images.… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

  40. arXiv:2309.06421  [pdf, other

    eess.IV cs.CV

    AGMDT: Virtual Staining of Renal Histology Images with Adjacency-Guided Multi-Domain Transfer

    Authors: Tao Ma, Chao Zhang, Min Lu, Lin Luo

    Abstract: Renal pathology, as the gold standard of kidney disease diagnosis, requires doctors to analyze a series of tissue slices stained by H&E staining and special staining like Masson, PASM, and PAS, respectively. These special staining methods are costly, time-consuming, and hard to standardize for wide use especially in primary hospitals. Advances of supervised learning methods have enabled the virtua… ▽ More

    Submitted 17 September, 2023; v1 submitted 12 September, 2023; originally announced September 2023.

    Comments: BMVC 2023

  41. arXiv:2308.15144   

    eess.IV

    TKwinFormer: Top k Window Attention in Vision Transformers for Feature Matching

    Authors: Yun Liao, Yide Di, Hao Zhou, Kaijun Zhu, Mingyu Lu, Yijia Zhang, Qing Duan, Junhui Liu

    Abstract: Local feature matching remains a challenging task, primarily due to difficulties in matching sparse keypoints and low-texture regions. The key to solving this problem lies in effectively and accurately integrating global and local information. To achieve this goal, we introduce an innovative local feature matching method called TKwinFormer. Our approach employs a multi-stage matching strategy to o… ▽ More

    Submitted 30 March, 2025; v1 submitted 29 August, 2023; originally announced August 2023.

    Comments: After careful reconsideration, we have decided to withdraw the manuscript due to data inconsistencies and issues with methodology. Given these concerns, we believe it would be inappropriate to proceed with the revised version, and we have therefore decided to retract our submission

    ACM Class: I.4.7

  42. arXiv:2306.08955  [pdf, other

    eess.IV cs.CV cs.LG

    A Comparison of Self-Supervised Pretraining Approaches for Predicting Disease Risk from Chest Radiograph Images

    Authors: Yanru Chen, Michael T Lu, Vineet K Raghu

    Abstract: Deep learning is the state-of-the-art for medical imaging tasks, but requires large, labeled datasets. For risk prediction, large datasets are rare since they require both imaging and follow-up (e.g., diagnosis codes). However, the release of publicly available imaging data with diagnostic labels presents an opportunity for self and semi-supervised approaches to improve label efficiency for risk p… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: 33 pages, 22 figures, Accepted for publication at MIDL 2023

  43. arXiv:2306.05196  [pdf, other

    eess.IV cs.CV

    Channel prior convolutional attention for medical image segmentation

    Authors: Hejun Huang, Zuguo Chen, Ying Zou, Ming Lu, Chaoyang Chen

    Abstract: Characteristics such as low contrast and significant organ shape variations are often exhibited in medical images. The improvement of segmentation performance in medical imaging is limited by the generally insufficient adaptive capabilities of existing attention mechanisms. An efficient Channel Prior Convolutional Attention (CPCA) method is proposed in this paper, supporting the dynamic distributi… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

  44. arXiv:2304.06497  [pdf, other

    cs.CV eess.IV

    A Comprehensive Comparison of Projections in Omnidirectional Super-Resolution

    Authors: Huicheng Pi, Senmao Tian, Ming Lu, Jiaming Liu, Yandong Guo, Shunli Zhang

    Abstract: Super-Resolution (SR) has gained increasing research attention over the past few years. With the development of Deep Neural Networks (DNNs), many super-resolution methods based on DNNs have been proposed. Although most of these methods are aimed at ordinary frames, there are few works on super-resolution of omnidirectional frames. In these works, omnidirectional frames are projected from the 3D sp… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

    Comments: Accepted to ICASSP2023

  45. QARV: Quantization-Aware ResNet VAE for Lossy Image Compression

    Authors: Zhihao Duan, Ming Lu, Jack Ma, Yuning Huang, Zhan Ma, Fengqing Zhu

    Abstract: This paper addresses the problem of lossy image compression, a fundamental problem in image processing and information theory that is involved in many real-world applications. We start by reviewing the framework of variational autoencoders (VAEs), a powerful class of generative probabilistic models that has a deep connection to lossy compression. Based on VAEs, we develop a novel scheme for lossy… ▽ More

    Submitted 1 December, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

    Comments: Full version (19 pages, includes appendix) of the paper accepted by IEEE TPAMI

  46. Efficient Visual Computing with Camera RAW Snapshots

    Authors: Zhihao Li, Ming Lu, Xu Zhang, Xin Feng, M. Salman Asif, Zhan Ma

    Abstract: Conventional cameras capture image irradiance on a sensor and convert it to RGB images using an image signal processor (ISP). The images can then be used for photography or visual computing tasks in a variety of applications, such as public safety surveillance and autonomous driving. One can argue that since RAW images contain all the captured information, the conversion of RAW to RGB using an ISP… ▽ More

    Submitted 25 January, 2024; v1 submitted 15 December, 2022; originally announced December 2022.

    Comments: Accepted by T-PAMI 2024. Homepage: https://njuvision.github.io/rho-vision

  47. Efficient Rigid Body Localization based on Euclidean Distance Matrix Completion for AGV Positioning under Harsh Environment

    Authors: Xinyuan An, Xiaowei Cui, Sihao Zhao, Gang Liu, Mingquan Lu

    Abstract: In real-world applications for automatic guided vehicle (AGV) navigation, the positioning system based on the time-of-flight (TOF) measurements between anchors and tags is confronted with the problem of insufficient measurements caused by blockages to radio signals or lasers, etc. Mounting multiple tags at different positions of the AGV to collect more TOFs is a feasible solution to tackle this di… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

  48. A priori knowledge-free fast positioning approach for BeiDou receivers

    Authors: Sihao Zhao, Xiaowei Cui, Mingquan Lu

    Abstract: A Global Navigation Satellite System (GNSS) receiver usually needs a sufficient number of full pseudorange measurements to obtain a position solution. However, it is time-consuming to acquire full pseudorange information from only the satellite broadcast signals due to the navigation data features of GNSS. In order to realize fast positioning during a cold or warm start in a GNSS receiver, the exi… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

  49. arXiv:2211.02854  [pdf, other

    eess.IV

    Rate-Distortion Optimized Post-Training Quantization for Learned Image Compression

    Authors: Junqi Shi, Ming Lu, Zhan Ma

    Abstract: Quantizing a floating-point neural network to its fixed-point representation is crucial for Learned Image Compression (LIC) because it improves decoding consistency for interoperability and reduces space-time complexity for implementation. Existing solutions often have to retrain the network for model quantization, which is time-consuming and impractical to some extent. This work suggests using Po… ▽ More

    Submitted 8 October, 2023; v1 submitted 5 November, 2022; originally announced November 2022.

  50. arXiv:2210.01438  [pdf, other

    eess.IV cs.CV

    Complementary consistency semi-supervised learning for 3D left atrial image segmentation

    Authors: Hejun Huang, Zuguo Chen, Chaoyang Chen, Ming Lu, Ying Zou

    Abstract: A network based on complementary consistency training, called CC-Net, has been proposed for semi-supervised left atrium image segmentation. CC-Net efficiently utilizes unlabeled data from the perspective of complementary information to address the problem of limited ability of existing semi-supervised segmentation algorithms to extract information from unlabeled data. The complementary symmetric s… ▽ More

    Submitted 4 April, 2023; v1 submitted 4 October, 2022; originally announced October 2022.