-
Real-World Remote Sensing Image Dehazing: Benchmark and Baseline
Authors:
Zeng-Hui Zhu,
Wei Lu,
Si-Bao Chen,
Chris H. Q. Ding,
Jin Tang,
Bin Luo
Abstract:
Remote Sensing Image Dehazing (RSID) poses significant challenges in real-world scenarios due to the complex atmospheric conditions and severe color distortions that degrade image quality. The scarcity of real-world remote sensing hazy image pairs has compelled existing methods to rely primarily on synthetic datasets. However, these methods struggle with real-world applications due to the inherent…
▽ More
Remote Sensing Image Dehazing (RSID) poses significant challenges in real-world scenarios due to the complex atmospheric conditions and severe color distortions that degrade image quality. The scarcity of real-world remote sensing hazy image pairs has compelled existing methods to rely primarily on synthetic datasets. However, these methods struggle with real-world applications due to the inherent domain gap between synthetic and real data. To address this, we introduce Real-World Remote Sensing Hazy Image Dataset (RRSHID), the first large-scale dataset featuring real-world hazy and dehazed image pairs across diverse atmospheric conditions. Based on this, we propose MCAF-Net, a novel framework tailored for real-world RSID. Its effectiveness arises from three innovative components: Multi-branch Feature Integration Block Aggregator (MFIBA), which enables robust feature extraction through cascaded integration blocks and parallel multi-branch processing; Color-Calibrated Self-Supervised Attention Module (CSAM), which mitigates complex color distortions via self-supervised learning and attention-guided refinement; and Multi-Scale Feature Adaptive Fusion Module (MFAFM), which integrates features effectively while preserving local details and global context. Extensive experiments validate that MCAF-Net demonstrates state-of-the-art performance in real-world RSID, while maintaining competitive performance on synthetic datasets. The introduction of RRSHID and MCAF-Net sets new benchmarks for real-world RSID research, advancing practical solutions for this complex task. The code and dataset are publicly available at https://github.com/lwCVer/RRSHID.
△ Less
Submitted 27 June, 2025; v1 submitted 23 March, 2025;
originally announced March 2025.
-
qGDP: Quantum Legalization and Detailed Placement for Superconducting Quantum Computers
Authors:
Junyao Zhang,
Guanglei Zhou,
Feng Cheng,
Jonathan Ku,
Qi Ding,
Jiaqi Gu,
Hanrui Wang,
Hai "Helen" Li,
Yiran Chen
Abstract:
Noisy Intermediate-Scale Quantum (NISQ) computers are currently limited by their qubit numbers, which hampers progress towards fault-tolerant quantum computing. A major challenge in scaling these systems is crosstalk, which arises from unwanted interactions among neighboring components such as qubits and resonators. An innovative placement strategy tailored for superconducting quantum computers ca…
▽ More
Noisy Intermediate-Scale Quantum (NISQ) computers are currently limited by their qubit numbers, which hampers progress towards fault-tolerant quantum computing. A major challenge in scaling these systems is crosstalk, which arises from unwanted interactions among neighboring components such as qubits and resonators. An innovative placement strategy tailored for superconducting quantum computers can systematically address crosstalk within the constraints of limited substrate areas.
Legalization is a crucial stage in placement process, refining post-global-placement configurations to satisfy design constraints and enhance layout quality. However, existing legalizers are not supported to legalize quantum placements. We aim to address this gap with qGDP, developed to meticulously legalize quantum components by adhering to quantum spatial constraints and reducing resonator crossing to alleviate various crosstalk effects.
Our results indicate that qGDP effectively legalizes and fine-tunes the layout, addressing the quantum-specific spatial constraints inherent in various device topologies. By evaluating diverse NISQ benchmarks. qGDP consistently outperforms state-of-the-art legalization engines, delivering substantial improvements in fidelity and reducing spatial violation, with average gains of 34.4x and 16.9x, respectively.
△ Less
Submitted 2 November, 2024;
originally announced November 2024.
-
Bi-modality Images Transfer with a Discrete Process Matching Method
Authors:
Zhe Xiong,
Qiaoqiao Ding,
Xiaoqun Zhang
Abstract:
Recently, medical image synthesis gains more and more popularity, along with the rapid development of generative models. Medical image synthesis aims to generate an unacquired image modality, often from other observed data modalities. Synthesized images can be used for clinical diagnostic assistance, data augmentation for model training and validation or image quality improving. In the meanwhile,…
▽ More
Recently, medical image synthesis gains more and more popularity, along with the rapid development of generative models. Medical image synthesis aims to generate an unacquired image modality, often from other observed data modalities. Synthesized images can be used for clinical diagnostic assistance, data augmentation for model training and validation or image quality improving. In the meanwhile, the flow-based models are among the successful generative models for the ability of generating realistic and high-quality synthetic images. However, most flow-based models require to calculate flow ordinary different equation (ODE) evolution steps in transfer process, for which the performances are significantly limited by heavy computation time due to a large number of time iterations. In this paper, we propose a novel flow-based model, namely Discrete Process Matching (DPM) to accomplish the bi-modality image transfer tasks. Different to other flow matching based models, we propose to utilize both forward and backward ODE flow and enhance the consistency on the intermediate images of few discrete time steps, resulting in a transfer process with much less iteration steps while maintaining high-quality generations for both modalities. Our experiments on three datasets of MRI T1/T2 and CT/MRI demonstrate that DPM outperforms other state-of-the-art flow-based methods for bi-modality image synthesis, achieving higher image quality with less computation time cost.
△ Less
Submitted 23 September, 2024; v1 submitted 5 September, 2024;
originally announced September 2024.
-
Multi-Scale Frequency-Enhanced Deep D-bar Method for Electrical Impedance Tomography
Authors:
Xiang Cao,
Qiaoqiao Ding,
Xiaoqun Zhang
Abstract:
The regularized D-bar method is a popular method for solving Electrical Impedance Tomography (EIT) problems due to its efficiency and simplicity. It utilizes the low-pass truncated scattering data in the non-linear Fourier domain to solve the associated D-bar integral equations, yielding a smooth conductivity approximation. However, the D-bar reconstruction often presents low contrast and resoluti…
▽ More
The regularized D-bar method is a popular method for solving Electrical Impedance Tomography (EIT) problems due to its efficiency and simplicity. It utilizes the low-pass truncated scattering data in the non-linear Fourier domain to solve the associated D-bar integral equations, yielding a smooth conductivity approximation. However, the D-bar reconstruction often presents low contrast and resolution due to the absence of accurate high-frequency information and the ill-posedness of the problem. In this paper, we propose a deep learning-based supervised approach for real-time EIT reconstruction. Based on the D-bar method, we propose to utilize both multi-scale frequency enhancement and spatial consistency for a high image quality reconstruction. Additionally, we propose a fixed-point iteration for solving discrete D-bar systems on GPUs for fast computation. Numerical results are performed for both the continuum model and complete electrode model simulation on KIT4 and ACT4 datasets to demonstrate notable improvements in absolute EIT imaging quality.
△ Less
Submitted 7 February, 2025; v1 submitted 12 May, 2024;
originally announced July 2024.
-
Refined Motion Compensation with Soft Laser Manipulators using Data-Driven Surrogate Models
Authors:
Yongjun Yan,
Qingpeng Ding,
Mingwu Li,
Junyan Yan,
Shing Shin Cheng
Abstract:
Non-contact laser ablation, a precise thermal technique, simultaneously cuts and coagulates tissue without the insertion errors associated with rigid needles. Human organ motions, such as those in the liver, exhibit rhythmic components influenced by respiratory and cardiac cycles, making effective laser energy delivery to target lesions while compensating for tumor motion crucial. This research in…
▽ More
Non-contact laser ablation, a precise thermal technique, simultaneously cuts and coagulates tissue without the insertion errors associated with rigid needles. Human organ motions, such as those in the liver, exhibit rhythmic components influenced by respiratory and cardiac cycles, making effective laser energy delivery to target lesions while compensating for tumor motion crucial. This research introduces a data-driven method to derive surrogate models of a soft manipulator. These low-dimensional models offer computational efficiency when integrated into the Model Predictive Control (MPC) framework, while still capturing the manipulator's dynamics with and without control input. Spectral Submanifolds (SSM) theory models the manipulator's autonomous dynamics, acknowledging its tendency to reach equilibrium when external forces are removed. Preliminary results show that the MPC controller using the surrogate model outperforms two other models within the same MPC framework. The data-driven MPC controller also supports a design-agnostic feature, allowing the interchangeability of different soft manipulators within the laser ablation surgery robot system.
△ Less
Submitted 18 January, 2025; v1 submitted 1 July, 2024;
originally announced July 2024.
-
Quaternion-Based Attitude Stabilization Using Synergistic Hybrid Feedback With Minimal Potential Functions
Authors:
Xin Tong,
Qingpeng Ding,
Haiyang Fang,
Shing Shin Cheng
Abstract:
This paper investigates the robust global attitude stabilization problem for a rigid-body system using quaternion-based feedback. We propose a novel synergistic hybrid feedback with the following notable features: (1) It demonstrates central synergism by utilizing a minimal number of potential functions; (2) It ensures consistency with respect to the unit quaternion representation of rigid-body at…
▽ More
This paper investigates the robust global attitude stabilization problem for a rigid-body system using quaternion-based feedback. We propose a novel synergistic hybrid feedback with the following notable features: (1) It demonstrates central synergism by utilizing a minimal number of potential functions; (2) It ensures consistency with respect to the unit quaternion representation of rigid-body attitude; (3) Its state-feedback laws incorporate a shared action term that steers the system toward the desired attitude. We demonstrate that the proposed hybrid feedback method effectively solves the problem at hand and guarantees robust uniform global asymptotic stability.
△ Less
Submitted 12 April, 2024;
originally announced April 2024.
-
Intelligent Reflecting Surfaces vs. Full-Duplex Relays: A Comparison in the Air
Authors:
Qian Ding,
Jie Yang,
Yang Luo,
Chunbo Luo
Abstract:
This letter aims to provide a fundamental analytical comparison for the two major types of relaying methods: intelligent reflecting surfaces and full-duplex relays, particularly focusing on unmanned aerial vehicle communication scenarios. Both amplify-and-forward and decode-and-forward relaying schemes are included in the comparison. In addition, optimal 3D UAV deployment and minimum transmit powe…
▽ More
This letter aims to provide a fundamental analytical comparison for the two major types of relaying methods: intelligent reflecting surfaces and full-duplex relays, particularly focusing on unmanned aerial vehicle communication scenarios. Both amplify-and-forward and decode-and-forward relaying schemes are included in the comparison. In addition, optimal 3D UAV deployment and minimum transmit power under the quality of service constraint are derived. Our numerical results show that IRSs of medium size exhibit comparable performance to AF relays, meanwhile outperforming DF relays under extremely large surface size and high data rates.
△ Less
Submitted 14 March, 2024;
originally announced March 2024.
-
Towards the THz Networks in the 6G Era
Authors:
Qian Ding,
Jie Yang,
Yang Luo,
Chunbo Luo
Abstract:
This commentary dedicates to envision what role THz is going to play in the coming human-centric 6G era. Three distinct THz network types including outdoor, indoor, and body area networks are discussed, with an emphasis on their capabilities in human body detection. Synthesizing these networks will unlock a bunch of fascinating applications across industrial, biomedical and entertainment fields, s…
▽ More
This commentary dedicates to envision what role THz is going to play in the coming human-centric 6G era. Three distinct THz network types including outdoor, indoor, and body area networks are discussed, with an emphasis on their capabilities in human body detection. Synthesizing these networks will unlock a bunch of fascinating applications across industrial, biomedical and entertainment fields, significantly enhancing the quality of human life.
△ Less
Submitted 13 March, 2024;
originally announced March 2024.
-
Qplacer: Frequency-Aware Component Placement for Superconducting Quantum Computers
Authors:
Junyao Zhang,
Hanrui Wang,
Qi Ding,
Jiaqi Gu,
Reouven Assouly,
William D. Oliver,
Song Han,
Kenneth R. Brown,
Hai "Helen" Li,
Yiran Chen
Abstract:
Noisy Intermediate-Scale Quantum (NISQ) computers face a critical limitation in qubit numbers, hindering their progression towards large-scale and fault-tolerant quantum computing. A significant challenge impeding scaling is crosstalk, characterized by unwanted interactions among neighboring components on quantum chips, including qubits, resonators, and substrate. We motivate a general approach to…
▽ More
Noisy Intermediate-Scale Quantum (NISQ) computers face a critical limitation in qubit numbers, hindering their progression towards large-scale and fault-tolerant quantum computing. A significant challenge impeding scaling is crosstalk, characterized by unwanted interactions among neighboring components on quantum chips, including qubits, resonators, and substrate. We motivate a general approach to systematically resolving multifaceted crosstalks in a limited substrate area. We propose Qplacer, a frequency-aware electrostatic-based placement framework tailored for superconducting quantum computers, to alleviate crosstalk by isolating these components in spatial and frequency domains alongside compact substrate design. Qplacer commences with a frequency assigner that ensures frequency domain isolation for qubits and resonators. It then incorporates a padding strategy and resonator partitioning for layout flexibility. Central to our approach is the conceptualization of quantum components as charged particles, enabling strategic spatial isolation through a 'frequency repulsive force' concept. Our results demonstrate that Qplacer carefully crafts the physical component layout in mitigating various crosstalk impacts while maintaining a compact substrate size. On various device topologies and NISQ benchmarks, Qplacer improves fidelity by an average of 36.7x and reduces spatial violations (susceptible to crosstalk) by an average of 12.76x, compared to classical placement engines. Regarding area optimization, compared to manual designs, Qplacer can reduce the required layout area by 2.14x on average
△ Less
Submitted 2 November, 2024; v1 submitted 30 January, 2024;
originally announced January 2024.
-
Stochastic Dynamic Power Dispatch with High Generalization and Few-Shot Adaption via Contextual Meta Graph Reinforcement Learning
Authors:
Bairong Deng,
Tao Yu,
Zhenning Pan,
Xuehan Zhang,
Yufeng Wu,
Qiaoyi Ding
Abstract:
Reinforcement learning is an emerging approaches to facilitate multi-stage sequential decision-making problems. This paper studies a real-time multi-stage stochastic power dispatch considering multivariate uncertainties. Current researches suffer from low generalization and practicality, that is, the learned dispatch policy can only handle a specific dispatch scenario, its performance degrades sig…
▽ More
Reinforcement learning is an emerging approaches to facilitate multi-stage sequential decision-making problems. This paper studies a real-time multi-stage stochastic power dispatch considering multivariate uncertainties. Current researches suffer from low generalization and practicality, that is, the learned dispatch policy can only handle a specific dispatch scenario, its performance degrades significantly if actual samples and training samples are inconsistent. To fill these gaps, a novel contextual meta graph reinforcement learning (Meta-GRL) for a highly generalized multi-stage optimal dispatch policy is proposed. Specifically, a more general contextual Markov decision process (MDP) and scalable graph representation are introduced to achieve a more generalized multi-stage stochastic power dispatch modeling. An upper meta-learner is proposed to encode context for different dispatch scenarios and learn how to achieve dispatch task identification while the lower policy learner learns context-specified dispatch policy. After sufficient offline learning, this approach can rapidly adapt to unseen and undefined scenarios with only a few updations of the hypothesis judgments generated by the meta-learner. Numerical comparisons with state-of-the-art policies and traditional reinforcement learning verify the optimality, efficiency, adaptability, and scalability of the proposed Meta-GRL.
△ Less
Submitted 19 January, 2024;
originally announced January 2024.
-
A Dataset-free Deep learning Method for Low-Dose CT Image Reconstruction
Authors:
Qiaoqiao Ding,
Hui Ji,
Yuhui Quan,
Xiaoqun Zhang
Abstract:
Low-dose CT (LDCT) imaging attracted a considerable interest for the reduction of the object's exposure to X-ray radiation. In recent years, supervised deep learning (DL) has been extensively studied for LDCT image reconstruction, which trains a network over a dataset containing many pairs of normal-dose and low-dose images. However, the challenge on collecting many such pairs in the clinical setu…
▽ More
Low-dose CT (LDCT) imaging attracted a considerable interest for the reduction of the object's exposure to X-ray radiation. In recent years, supervised deep learning (DL) has been extensively studied for LDCT image reconstruction, which trains a network over a dataset containing many pairs of normal-dose and low-dose images. However, the challenge on collecting many such pairs in the clinical setup limits the application of such supervised-learning-based methods for LDCT image reconstruction in practice. Aiming at addressing the challenges raised by the collection of training dataset, this paper proposed a unsupervised deep learning method for LDCT image reconstruction, which does not require any external training data. The proposed method is built on a re-parametrization technique for Bayesian inference via deep network with random weights, combined with additional total variational~(TV) regularization. The experiments show that the proposed method noticeably outperforms existing dataset-free image reconstruction methods on the test data.
△ Less
Submitted 5 October, 2022; v1 submitted 1 May, 2022;
originally announced May 2022.
-
Operating Characteristics for Binary Hypothesis Testing in Quantum Systems
Authors:
Catherine Medlock,
Alan Oppenheim,
Isaac Chuang,
Qi Ding
Abstract:
Receiver operating characteristics (ROCs) are a well-established representation of the tradeoff between detection and false alarm probabilities in classical binary hypothesis testing. We use classical ROCs as motivation for two types of operating characteristics for binary hypothesis testing in quantum systems -- decision operating characteristics (QDOCs) and measurement operating characteristics…
▽ More
Receiver operating characteristics (ROCs) are a well-established representation of the tradeoff between detection and false alarm probabilities in classical binary hypothesis testing. We use classical ROCs as motivation for two types of operating characteristics for binary hypothesis testing in quantum systems -- decision operating characteristics (QDOCs) and measurement operating characteristics (QMOCs). Both are described in the context of a framework we propose that encompasses the typical formulations of binary hypothesis testing in both the classical and quantum scenarios. We interpret Helstrom's well-known result regarding discrimination between two quantum density operators with minimum probability of error in this framework. We also present a generalization of previous results regarding the correspondence between classical Parseval frames and quantum measurements. The derivation naturally leads to a constructive procedure for generating many different measurements besides Helstrom's optimal measurement, some standard and others non-standard, that achieve minimum probability of error.
△ Less
Submitted 14 December, 2020;
originally announced December 2020.
-
Joint Resource Optimization for IRS-Assisted mmWave MIMO under QoS Constraints
Authors:
Qingfeng Ding,
Xinpeng Gao,
Zexiang Wu
Abstract:
This letter focuses on the non-convex joint optimization with a dynamic resource of multi-user for an intelligent reflecting surface-enhanced mmWave system, where all users are concentrated on the unique cluster beam. Firstly, the objective function of the above non-linear problem is converted into a quadratic programming form under the quality of service constraints. Further, a multi-blocks alter…
▽ More
This letter focuses on the non-convex joint optimization with a dynamic resource of multi-user for an intelligent reflecting surface-enhanced mmWave system, where all users are concentrated on the unique cluster beam. Firstly, the objective function of the above non-linear problem is converted into a quadratic programming form under the quality of service constraints. Further, a multi-blocks alternating optimization framework with dynamic power allocation is proposed to obtain the maximum sum-rate, where the relaxed ADMM algorithm is adopted to tackle the optimal fulldigital precoder and the corresponding passive reflecting matrix is obtained by the gradient-projection. The numerical results verify that beam optimization should be emphasized in high SINR, but joint dynamic resource allocation can further improve system performance even if the hardware dimensions reaches the limit.
△ Less
Submitted 16 November, 2020;
originally announced November 2020.
-
AHP-Net: adaptive-hyper-parameter deep learning based image reconstruction method for multilevel low-dose CT
Authors:
Qiaoqiao Ding,
Yuesong Nan,
Hao Gao,
Hui Ji
Abstract:
Low-dose CT (LDCT) imaging is desirable in many clinical applications to reduce X-ray radiation dose to patients. Inspired by deep learning (DL), a recent promising direction of model-based iterative reconstruction (MBIR) methods for LDCT is via optimization-unrolling DL-regularized image reconstruction, where pre-defined image prior is replaced by learnable data-adaptive prior. However, LDCT is c…
▽ More
Low-dose CT (LDCT) imaging is desirable in many clinical applications to reduce X-ray radiation dose to patients. Inspired by deep learning (DL), a recent promising direction of model-based iterative reconstruction (MBIR) methods for LDCT is via optimization-unrolling DL-regularized image reconstruction, where pre-defined image prior is replaced by learnable data-adaptive prior. However, LDCT is clinically multilevel, since clinical scans have different noise levels that depend of scanning site, patient size, and clinical task. Therefore, this work aims to develop an adaptive-hyper-parameter DL-based image reconstruction method (AHP-Net) that can handle multilevel LDCT of different noise levels. AHP-Net unrolls a half-quadratic splitting scheme with learnable image prior built on framelet filter bank, and learns a network that automatically adjusts the hyper-parameters for various noise levels. As a result, AHP-Net provides a single universal training model that can handle multilevel LDCT. Extensive experimental evaluations using clinical scans suggest that AHP-Net outperformed conventional MBIR techniques and state-of-the-art deep-learning-based methods for multilevel LDCT of different noise levels.
△ Less
Submitted 17 February, 2021; v1 submitted 11 August, 2020;
originally announced August 2020.
-
Low-Dose CT with Deep Learning Regularization via Proximal Forward Backward Splitting
Authors:
Qiaoqiao Ding,
Gaoyu Chen,
Xiaoqun Zhang,
Qiu Huang,
Hui Jiand Hao Gao
Abstract:
Low dose X-ray computed tomography (LDCT) is desirable for reduced patient dose. This work develops image reconstruction methods with deep learning (DL) regularization for LDCT. Our methods are based on unrolling of proximal forward-backward splitting (PFBS) framework with data-driven image regularization via deep neural networks. In contrast with PFBS-IR that utilizes standard data fidelity updat…
▽ More
Low dose X-ray computed tomography (LDCT) is desirable for reduced patient dose. This work develops image reconstruction methods with deep learning (DL) regularization for LDCT. Our methods are based on unrolling of proximal forward-backward splitting (PFBS) framework with data-driven image regularization via deep neural networks. In contrast with PFBS-IR that utilizes standard data fidelity updates via iterative reconstruction (IR) method, PFBS-AIR involves preconditioned data fidelity updates that fuse analytical reconstruction (AR) method and IR in a synergistic way, I.e. fused analytical and iterative reconstruction (AIR). The results suggest that DL-regularized methods (PFBS-IR and PFBS-AIR) provided better reconstruction quality from conventional wisdoms (AR or IR), and DL-based postprocessing method (FBPConvNet). In addition, owing to AIR, PFBS-AIR noticeably outperformed PFBS-IR.
△ Less
Submitted 21 September, 2019;
originally announced September 2019.