Skip to main content

Showing 1–50 of 67 results for author: Duan, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2506.10813  [pdf, ps, other

    cs.CV eess.IV eess.SP

    Unsupervised Deformable Image Registration with Structural Nonparametric Smoothing

    Authors: Hang Zhang, Xiang Chen, Renjiu Hu, Rongguang Wang, Jinwei Zhang, Min Liu, Yaonan Wang, Gaolei Li, Xinxing Cheng, Jinming Duan

    Abstract: Learning-based deformable image registration (DIR) accelerates alignment by amortizing traditional optimization via neural networks. Label supervision further enhances accuracy, enabling efficient and precise nonlinear alignment of unseen scans. However, images with sparse features amid large smooth regions, such as retinal vessels, introduce aperture and large-displacement challenges that unsuper… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

    Comments: Accepted for publication at Information Processing in Medical Imaging (IPMI) 2025

  2. arXiv:2506.04552  [pdf

    eess.SP

    DAS-MAE: A self-supervised pre-training framework for universal and high-performance representation learning of distributed fiber-optic acoustic sensing

    Authors: Junyi Duan, Jiageng Chen, Zuyuan He

    Abstract: Distributed fiber-optic acoustic sensing (DAS) has emerged as a transformative approach for distributed vibration measurement with high spatial resolution and long measurement range while maintaining cost-efficiency. However, the two-dimensional spatial-temporal DAS signals present analytical challenges. The abstract signal morphology lacking intuitive physical correspondence complicates human int… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

  3. arXiv:2505.09327  [pdf, ps, other

    math.DS eess.SY

    Adaptive control for multi-scale stochastic dynamical systems with stochastic next generation reservoir computing

    Authors: Jiani Cheng, Ting Gao, Jinqiao Duan

    Abstract: The rapid advancement of neuroscience and machine learning has established data-driven stochastic dynamical system modeling as a powerful tool for understanding and controlling high-dimensional, spatio-temporal processes. We introduce the stochastic next-generation reservoir computing (NG-RC) controller, a framework that integrates the computational efficiency of NG-RC with stochastic analysis to… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

    Comments: 30 pages, 14 figures

  4. arXiv:2503.23179  [pdf, other

    eess.IV cs.CV

    OncoReg: Medical Image Registration for Oncological Challenges

    Authors: Wiebke Heyer, Yannic Elser, Lennart Berkel, Xinrui Song, Xuanang Xu, Pingkun Yan, Xi Jia, Jinming Duan, Zi Li, Tony C. W. Mok, BoWen LI, Christian Staackmann, Christoph Großbröhmer, Lasse Hansen, Alessa Hering, Malte M. Sieren, Mattias P. Heinrich

    Abstract: In modern cancer research, the vast volume of medical data generated is often underutilised due to challenges related to patient privacy. The OncoReg Challenge addresses this issue by enabling researchers to develop and validate image registration methods through a two-phase framework that ensures patient privacy while fostering the development of more generalisable AI models. Phase one involves w… ▽ More

    Submitted 1 April, 2025; v1 submitted 29 March, 2025; originally announced March 2025.

    Comments: 26 pages, 6 figures

  5. arXiv:2503.14386  [pdf, other

    physics.med-ph eess.IV

    A Comprehensive Scatter Correction Model for Micro-Focus Dual-Source Imaging Systems: Combining Ambient, Cross, and Forward Scatter

    Authors: Jianing Sun, Jigang Duan, Guangyin Li, Xu Jiang, Xing Zhao

    Abstract: Compared to single-source imaging systems, dual-source imaging systems equipped with two cross-distributed scanning beams significantly enhance temporal resolution and capture more comprehensive object scanning information. Nevertheless, the interaction between the two scanning beams introduces more complex scatter signals into the acquired projection data. Existing methods typically model these s… ▽ More

    Submitted 18 March, 2025; originally announced March 2025.

  6. arXiv:2502.04399  [pdf, other

    cs.LG cs.AI eess.SY

    Online Location Planning for AI-Defined Vehicles: Optimizing Joint Tasks of Order Serving and Spatio-Temporal Heterogeneous Model Fine-Tuning

    Authors: Bokeng Zheng, Bo Rao, Tianxiang Zhu, Chee Wei Tan, Jingpu Duan, Zhi Zhou, Xu Chen, Xiaoxi Zhang

    Abstract: Advances in artificial intelligence (AI) including foundation models (FMs), are increasingly transforming human society, with smart city driving the evolution of urban living.Meanwhile, vehicle crowdsensing (VCS) has emerged as a key enabler, leveraging vehicles' mobility and sensor-equipped capabilities. In particular, ride-hailing vehicles can effectively facilitate flexible data collection and… ▽ More

    Submitted 6 February, 2025; originally announced February 2025.

  7. arXiv:2501.15217  [pdf, other

    cs.LG eess.SY

    Predictive Lagrangian Optimization for Constrained Reinforcement Learning

    Authors: Tianqi Zhang, Puzhen Yuan, Guojian Zhan, Ziyu Lin, Yao Lyu, Zhenzhi Qin, Jingliang Duan, Liping Zhang, Shengbo Eben Li

    Abstract: Constrained optimization is popularly seen in reinforcement learning for addressing complex control tasks. From the perspective of dynamic system, iteratively solving a constrained optimization problem can be framed as the temporal evolution of a feedback control system. Classical constrained optimization methods, such as penalty and Lagrangian approaches, inherently use proportional and integral… ▽ More

    Submitted 25 January, 2025; originally announced January 2025.

  8. arXiv:2410.16821  [pdf, other

    cs.RO eess.SY

    Guiding Reinforcement Learning with Incomplete System Dynamics

    Authors: Shuyuan Wang, Jingliang Duan, Nathan P. Lawrence, Philip D. Loewen, Michael G. Forbes, R. Bhushan Gopaluni, Lixian Zhang

    Abstract: Model-free reinforcement learning (RL) is inherently a reactive method, operating under the assumption that it starts with no prior knowledge of the system and entirely depends on trial-and-error for learning. This approach faces several challenges, such as poor sample efficiency, generalization, and the need for well-designed reward functions to guide learning effectively. On the other hand, cont… ▽ More

    Submitted 23 October, 2024; v1 submitted 22 October, 2024; originally announced October 2024.

    Comments: Accepted to IROS 2024

  9. arXiv:2404.03179  [pdf, other

    cs.CV cs.MM cs.SD eess.AS

    UniAV: Unified Audio-Visual Perception for Multi-Task Video Event Localization

    Authors: Tiantian Geng, Teng Wang, Yanfu Zhang, Jinming Duan, Weili Guan, Feng Zheng, Ling shao

    Abstract: Video localization tasks aim to temporally locate specific instances in videos, including temporal action localization (TAL), sound event detection (SED) and audio-visual event localization (AVEL). Existing methods over-specialize on each task, overlooking the fact that these instances often occur in the same video to form the complete video content. In this work, we present UniAV, a Unified Audio… ▽ More

    Submitted 11 August, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

    Comments: This work has been submitted to the IEEE for possible publication

  10. arXiv:2402.03585  [pdf, other

    cs.CV eess.IV

    Decoder-Only Image Registration

    Authors: Xi Jia, Wenqi Lu, Xinxing Cheng, Jinming Duan

    Abstract: In unsupervised medical image registration, the predominant approaches involve the utilization of a encoder-decoder network architecture, allowing for precise prediction of dense, full-resolution displacement fields from given paired images. Despite its widespread use in the literature, we argue for the necessity of making both the encoder and decoder learnable in such an architecture. For this, w… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  11. arXiv:2310.19022  [pdf, other

    math.OC cs.LG eess.SY

    Optimization Landscape of Policy Gradient Methods for Discrete-time Static Output Feedback

    Authors: Jingliang Duan, Jie Li, Xuyang Chen, Kai Zhao, Shengbo Eben Li, Lin Zhao

    Abstract: In recent times, significant advancements have been made in delving into the optimization landscape of policy gradient methods for achieving optimal control in linear time-invariant (LTI) systems. Compared with state-feedback control, output-feedback control is more prevalent since the underlying state of the system may not be fully observed in many practical settings. This paper analyzes the opti… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

    Journal ref: IEEE Transactions on Cybernetics, 2023

  12. Distributional Soft Actor-Critic with Three Refinements

    Authors: Jingliang Duan, Wenxuan Wang, Liming Xiao, Jiaxin Gao, Shengbo Eben Li, Chang Liu, Ya-Qin Zhang, Bo Cheng, Keqiang Li

    Abstract: Reinforcement learning (RL) has shown remarkable success in solving complex decision-making and control tasks. However, many model-free RL algorithms experience performance degradation due to inaccurate value estimation, particularly the overestimation of Q-values, which can lead to suboptimal policies. To address this issue, we previously proposed the Distributional Soft Actor-Critic (DSAC or DSA… ▽ More

    Submitted 1 February, 2025; v1 submitted 9 October, 2023; originally announced October 2023.

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2025

  13. VisionFM: a Multi-Modal Multi-Task Vision Foundation Model for Generalist Ophthalmic Artificial Intelligence

    Authors: Jianing Qiu, Jian Wu, Hao Wei, Peilun Shi, Minqing Zhang, Yunyun Sun, Lin Li, Hanruo Liu, Hongyi Liu, Simeng Hou, Yuyang Zhao, Xuehui Shi, Junfang Xian, Xiaoxia Qu, Sirui Zhu, Lijie Pan, Xiaoniao Chen, Xiaojia Zhang, Shuai Jiang, Kebing Wang, Chenlong Yang, Mingqiang Chen, Sujie Fan, Jianhua Hu, Aiguo Lv , et al. (17 additional authors not shown)

    Abstract: We present VisionFM, a foundation model pre-trained with 3.4 million ophthalmic images from 560,457 individuals, covering a broad range of ophthalmic diseases, modalities, imaging devices, and demography. After pre-training, VisionFM provides a foundation to foster multiple ophthalmic artificial intelligence (AI) applications, such as disease screening and diagnosis, disease prognosis, subclassifi… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

    Journal ref: The latest VisionFM work has been published in NEJM AI, 2024

  14. arXiv:2307.15273  [pdf, other

    cs.CV cs.LG eess.IV

    Recovering high-quality FODs from a reduced number of diffusion-weighted images using a model-driven deep learning architecture

    Authors: J Bartlett, C E Davey, L A Johnston, J Duan

    Abstract: Fibre orientation distribution (FOD) reconstruction using deep learning has the potential to produce accurate FODs from a reduced number of diffusion-weighted images (DWIs), decreasing total imaging time. Diffusion acquisition invariant representations of the DWI signals are typically used as input to these methods to ensure that they can be applied flexibly to data with different b-vectors and b-… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

    Comments: 10 pages, 7 figures, This work has been submitted to the IEEE for possible publication

    Journal ref: Magn Reson Med.2024;92:2193-2206

  15. arXiv:2307.05382  [pdf, other

    eess.SP cs.AI cs.LG

    Protecting the Future: Neonatal Seizure Detection with Spatial-Temporal Modeling

    Authors: Ziyue Li, Yuchen Fang, You Li, Kan Ren, Yansen Wang, Xufang Luo, Juanyong Duan, Congrui Huang, Dongsheng Li, Lili Qiu

    Abstract: A timely detection of seizures for newborn infants with electroencephalogram (EEG) has been a common yet life-saving practice in the Neonatal Intensive Care Unit (NICU). However, it requires great human efforts for real-time monitoring, which calls for automated solutions to neonatal seizure detection. Moreover, the current automated methods focusing on adult epilepsy monitoring often fail due to… ▽ More

    Submitted 2 July, 2023; originally announced July 2023.

    Comments: Accepted in IEEE International Conference on Systems, Man, and Cybernetics (SMC) 2023

  16. arXiv:2307.02997  [pdf, other

    eess.IV cs.CV

    Fourier-Net+: Leveraging Band-Limited Representation for Efficient 3D Medical Image Registration

    Authors: Xi Jia, Alexander Thorley, Alberto Gomez, Wenqi Lu, Dipak Kotecha, Jinming Duan

    Abstract: U-Net style networks are commonly utilized in unsupervised image registration to predict dense displacement fields, which for high-resolution volumetric image data is a resource-intensive and time-consuming task. To tackle this challenge, we first propose Fourier-Net, which replaces the costly U-Net style expansive path with a parameter-free model-driven decoder. Instead of directly predicting a f… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

    Comments: Under review. arXiv admin note: text overlap with arXiv:2211.16342

  17. arXiv:2305.18355  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    An Efficient Membership Inference Attack for the Diffusion Model by Proximal Initialization

    Authors: Fei Kong, Jinhao Duan, RuiPeng Ma, Hengtao Shen, Xiaofeng Zhu, Xiaoshuang Shi, Kaidi Xu

    Abstract: Recently, diffusion models have achieved remarkable success in generating tasks, including image and audio generation. However, like other generative models, diffusion models are prone to privacy issues. In this paper, we propose an efficient query-based membership inference attack (MIA), namely Proximal Initialization Attack (PIA), which utilizes groundtruth trajectory obtained by $ε$ initialized… ▽ More

    Submitted 9 October, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

  18. arXiv:2304.01041  [pdf, other

    cs.RO eess.SY

    Integrated Behavior Planning and Motion Control for Autonomous Vehicles with Traffic Rules Compliance

    Authors: Haichao Liu, Kai Chen, Yulin Li, Zhenmin Huang, Jianghua Duan, Jun Ma

    Abstract: In this article, we propose an optimization-based integrated behavior planning and motion control scheme, which is an interpretable and adaptable urban autonomous driving solution that complies with complex traffic rules while ensuring driving safety. Inherently, to ensure compliance with traffic rules, an innovative design of potential functions (PFs) is presented to characterize various traffic… ▽ More

    Submitted 30 November, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

    Comments: 7 pages, 5 figures, accepted for publication in The 2023 IEEE International Conference on Robotics and Biomimetics (ROBIO)

  19. arXiv:2303.12930  [pdf, other

    cs.CV cs.MM cs.SD eess.AS

    Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline

    Authors: Tiantian Geng, Teng Wang, Jinming Duan, Runmin Cong, Feng Zheng

    Abstract: Existing audio-visual event localization (AVE) handles manually trimmed videos with only a single instance in each of them. However, this setting is unrealistic as natural videos often contain numerous audio-visual events with different categories. To better adapt to real-life applications, in this paper we focus on the task of dense-localizing audio-visual events, which aims to jointly localize a… ▽ More

    Submitted 24 March, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

    Comments: Accepted by CVPR2023

  20. arXiv:2210.07553  [pdf, other

    cs.RO cs.LG eess.SY

    Safe Model-Based Reinforcement Learning with an Uncertainty-Aware Reachability Certificate

    Authors: Dongjie Yu, Wenjun Zou, Yujie Yang, Haitong Ma, Shengbo Eben Li, Jingliang Duan, Jianyu Chen

    Abstract: Safe reinforcement learning (RL) that solves constraint-satisfactory policies provides a promising way to the broader safety-critical applications of RL in real-world problems such as robotics. Among all safe RL approaches, model-based methods reduce training time violations further due to their high sample efficiency. However, lacking safety robustness against the model uncertainties remains an i… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: 12 pages, 6 figures

  21. On the Optimization Landscape of Dynamic Output Feedback: A Case Study for Linear Quadratic Regulator

    Authors: Jingliang Duan, Wenhan Cao, Yang Zheng, Lin Zhao

    Abstract: The convergence of policy gradient algorithms in reinforcement learning hinges on the optimization landscape of the underlying optimal control problem. Theoretical insights into these algorithms can often be acquired from analyzing those of linear quadratic control. However, most of the existing literature only considers the optimization landscape for static full-state or output feedback policies… ▽ More

    Submitted 29 October, 2023; v1 submitted 12 September, 2022; originally announced September 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2201.09598

    Journal ref: 2022 IEEE 61st Conference on Decision and Control (CDC)

  22. arXiv:2208.04939  [pdf, other

    eess.IV cs.CV

    U-Net vs Transformer: Is U-Net Outdated in Medical Image Registration?

    Authors: Xi Jia, Joseph Bartlett, Tianyang Zhang, Wenqi Lu, Zhaowen Qiu, Jinming Duan

    Abstract: Due to their extreme long-range modeling capability, vision transformer-based networks have become increasingly popular in deformable image registration. We believe, however, that the receptive field of a 5-layer convolutional U-Net is sufficient to capture accurate deformations without needing long-range dependencies. The purpose of this study is therefore to investigate whether U-Net-based metho… ▽ More

    Submitted 13 August, 2022; v1 submitted 7 August, 2022; originally announced August 2022.

    Comments: Accepted to MICCAI-MLMI 2022

  23. arXiv:2206.02346  [pdf, other

    math.OC cs.AI cs.LG eess.SY

    Convergence and sample complexity of natural policy gradient primal-dual methods for constrained MDPs

    Authors: Dongsheng Ding, Kaiqing Zhang, Jiali Duan, Tamer Başar, Mihailo R. Jovanović

    Abstract: We study sequential decision making problems aimed at maximizing the expected total reward while satisfying a constraint on the expected total utility. We employ the natural policy gradient method to solve the discounted infinite-horizon optimal control problem for Constrained Markov Decision Processes (constrained MDPs). Specifically, we propose a new Natural Policy Gradient Primal-Dual (NPG-PD)… ▽ More

    Submitted 28 August, 2024; v1 submitted 6 June, 2022; originally announced June 2022.

    Comments: 74 pages, 4 figures, 2 tables

  24. arXiv:2205.12857  [pdf, other

    eess.IV cs.CV

    Structure Unbiased Adversarial Model for Medical Image Segmentation

    Authors: Tianyang Zhang, Shaoming Zheng, Jun Cheng, Xi Jia, Joseph Bartlett, Xinxing Cheng, Huazhu Fu, Zhaowen Qiu, Jiang Liu, Jinming Duan

    Abstract: Generative models have been widely proposed in image recognition to generate more images where the distribution is similar to that of the real ones. It often introduces a discriminator network to differentiate the real data from the generated ones. Such models utilise a discriminator network tasked with differentiating style transferred data from data contained in the target dataset. However in do… ▽ More

    Submitted 30 July, 2024; v1 submitted 25 May, 2022; originally announced May 2022.

    Comments: Will revise the paper and resubmit

  25. arXiv:2204.04403  [pdf, other

    cs.RO eess.SY

    Improve Generalization of Driving Policy at Signalized Intersections with Adversarial Learning

    Authors: Yangang Ren, Guojian Zhan, Liye Tang, Shengbo Eben Li, Jianhua Jiang, Jingliang Duan

    Abstract: Intersections are quite challenging among various driving scenes wherein the interaction of signal lights and distinct traffic actors poses great difficulty to learn a wise and robust driving policy. Current research rarely considers the diversity of intersections and stochastic behaviors of traffic participants. For practical applications, the randomness usually leads to some devastating events,… ▽ More

    Submitted 9 April, 2022; originally announced April 2022.

  26. arXiv:2204.02857  [pdf, other

    eess.SY

    Primal-dual Estimator Learning: an Offline Constrained Moving Horizon Estimation Method with Feasibility and Near-optimality Guarantees

    Authors: Wenhan Cao, Jingliang Duan, Shengbo Eben Li, Chen Chen, Chang Liu, Yu Wang

    Abstract: This paper proposes a primal-dual framework to learn a stable estimator for linear constrained estimation problems leveraging the moving horizon approach. To avoid the online computational burden in most existing methods, we learn a parameterized function offline to approximate the primal estimate. Meanwhile, a dual estimator is trained to check the suboptimality of the primal estimator during exe… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

  27. On the Optimization Landscape of Dynamic Output Feedback Linear Quadratic Control

    Authors: Jingliang Duan, Wenhan Cao, Yang Zheng, Lin Zhao

    Abstract: The convergence of policy gradient algorithms hinges on the optimization landscape of the underlying optimal control problem. Theoretical insights into these algorithms can often be acquired from analyzing those of linear quadratic control. However, most of the existing literature only considers the optimization landscape for static full-state or output feedback policies (controllers). We investig… ▽ More

    Submitted 29 October, 2023; v1 submitted 24 January, 2022; originally announced January 2022.

    Journal ref: IEEE Transactions on Automatic Control (full paper), 2023

  28. arXiv:2112.09357  [pdf, other

    cs.CV cs.SD eess.AS

    Interpreting Audiograms with Multi-stage Neural Networks

    Authors: Shufan Li, Congxi Lu, Linkai Li, Jirong Duan, Xinping Fu, Haoshuai Zhou

    Abstract: Audiograms are a particular type of line charts representing individuals' hearing level at various frequencies. They are used by audiologists to diagnose hearing loss, and further select and tune appropriate hearing aids for customers. There have been several projects such as Autoaudio that aim to accelerate this process through means of machine learning. But all existing models at their best can… ▽ More

    Submitted 17 December, 2021; originally announced December 2021.

    Comments: 12pages,12 figures. The code for this project is available at https://github.com/jacklishufan/MAIN2021

  29. arXiv:2112.04489  [pdf, other

    eess.IV cs.CV

    Learn2Reg: comprehensive multi-task medical image registration challenge, dataset and evaluation in the era of deep learning

    Authors: Alessa Hering, Lasse Hansen, Tony C. W. Mok, Albert C. S. Chung, Hanna Siebert, Stephanie Häger, Annkristin Lange, Sven Kuckertz, Stefan Heldmann, Wei Shao, Sulaiman Vesal, Mirabela Rusu, Geoffrey Sonn, Théo Estienne, Maria Vakalopoulou, Luyi Han, Yunzhi Huang, Pew-Thian Yap, Mikael Brudfors, Yaël Balbastre, Samuel Joutard, Marc Modat, Gal Lifshitz, Dan Raviv, Jinxin Lv , et al. (28 additional authors not shown)

    Abstract: Image registration is a fundamental medical image analysis task, and a wide variety of approaches have been proposed. However, only a few studies have comprehensively compared medical image registration approaches on a wide range of clinically relevant tasks. This limits the development of registration methods, the adoption of research advances into practice, and a fair benchmark across competing… ▽ More

    Submitted 7 October, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

  30. Optimization Landscape of Gradient Descent for Discrete-time Static Output Feedback

    Authors: Jingliang Duan, Jie Li, Shengbo Eben Li, Lin Zhao

    Abstract: In this paper, we analyze the optimization landscape of gradient descent methods for static output feedback (SOF) control of discrete-time linear time-invariant systems with quadratic cost. The SOF setting can be quite common, for example, when there are unmodeled hidden states in the underlying process. We first establish several important properties of the SOF cost function, including coercivity… ▽ More

    Submitted 10 March, 2022; v1 submitted 27 September, 2021; originally announced September 2021.

    Journal ref: 2022 American Control Conference (ACC)

  31. arXiv:2109.05540  [pdf, other

    cs.RO eess.SY

    Encoding Distributional Soft Actor-Critic for Autonomous Driving in Multi-lane Scenarios

    Authors: Jingliang Duan, Yangang Ren, Fawang Zhang, Yang Guan, Dongjie Yu, Shengbo Eben Li, Bo Cheng, Lin Zhao

    Abstract: In this paper, we propose a new reinforcement learning (RL) algorithm, called encoding distributional soft actor-critic (E-DSAC), for decision-making in autonomous driving. Unlike existing RL-based decision-making methods, E-DSAC is suitable for situations where the number of surrounding vehicles is variable and eliminates the requirement for manually pre-designed sorting rules, resulting in highe… ▽ More

    Submitted 12 September, 2021; originally announced September 2021.

  32. arXiv:2108.11623  [pdf, other

    cs.LG cs.RO eess.SY

    Model-based Chance-Constrained Reinforcement Learning via Separated Proportional-Integral Lagrangian

    Authors: Baiyu Peng, Jingliang Duan, Jianyu Chen, Shengbo Eben Li, Genjin Xie, Congsheng Zhang, Yang Guan, Yao Mu, Enxin Sun

    Abstract: Safety is essential for reinforcement learning (RL) applied in the real world. Adding chance constraints (or probabilistic constraints) is a suitable way to enhance RL safety under uncertainty. Existing chance-constrained RL methods like the penalty methods and the Lagrangian methods either exhibit periodic oscillations or learn an over-conservative or unsafe policy. In this paper, we address thes… ▽ More

    Submitted 26 August, 2021; originally announced August 2021.

  33. Iterative Self-consistent Parallel Magnetic Resonance Imaging Reconstruction based on Nonlocal Low-Rank Regularization

    Authors: Ting Pan, Jizhong Duan, Junfeng Wang, Yu Liu

    Abstract: Iterative self-consistent parallel imaging reconstruction (SPIRiT) is an effective self-calibrated reconstruction model for parallel magnetic resonance imaging (PMRI). The joint L1 norm of wavelet coefficients and joint total variation (TV) regularization terms are incorporated into the SPIRiT model to improve the reconstruction performance. The simultaneous two-directional low-rankness (STDLR) in… ▽ More

    Submitted 17 April, 2022; v1 submitted 10 August, 2021; originally announced August 2021.

    Journal ref: Magnetic Resonance Imaging, vol. 88, pp. 62-75, 2022

  34. arXiv:2107.07907  [pdf, other

    eess.IV cs.CV cs.MM

    Lightness Modulated Deep Inverse Tone Mapping

    Authors: Kanglin Liu, Gaofeng Cao, Jiang Duan, Guoping Qiu

    Abstract: Single-image HDR reconstruction or inverse tone mapping (iTM) is a challenging task. In particular, recovering information in over-exposed regions is extremely difficult because details in such regions are almost completely lost. In this paper, we present a deep learning based iTM method that takes advantage of the feature extraction and mapping power of deep convolutional neural networks (CNNs) a… ▽ More

    Submitted 16 July, 2021; originally announced July 2021.

    Comments: 11 pages, 10 figures

  35. arXiv:2105.12227  [pdf, other

    cs.CV eess.IV

    Learning a Model-Driven Variational Network for Deformable Image Registration

    Authors: Xi Jia, Alexander Thorley, Wei Chen, Huaqi Qiu, Linlin Shen, Iain B Styles, Hyung Jin Chang, Ales Leonardis, Antonio de Marvao, Declan P. O'Regan, Daniel Rueckert, Jinming Duan

    Abstract: Data-driven deep learning approaches to image registration can be less accurate than conventional iterative approaches, especially when training data is limited. To address this whilst retaining the fast inference speed of deep learning, we propose VR-Net, a novel cascaded variational network for unsupervised deformable image registration. Using the variable splitting optimization scheme, we first… ▽ More

    Submitted 25 May, 2021; originally announced May 2021.

    Comments: This work has been submitted to the IEEE for possible publication

  36. Fixed-Dimensional and Permutation Invariant State Representation of Autonomous Driving

    Authors: Jingliang Duan, Dongjie Yu, Shengbo Eben Li, Wenxuan Wang, Yangang Ren, Ziyu Lin, Bo Cheng

    Abstract: In this paper, we propose a new state representation method, called encoding sum and concatenation (ESC), for the state representation of decision-making in autonomous driving. Unlike existing state representation methods, ESC is applicable to a variable number of surrounding vehicles and eliminates the need for manually pre-designed sorting rules, leading to higher representation ability and gene… ▽ More

    Submitted 4 March, 2022; v1 submitted 24 May, 2021; originally announced May 2021.

    Journal ref: IEEE Transactions on Intelligent Transportation Systems, 2021

  37. arXiv:2104.05810  [pdf, other

    cs.MA cs.GT eess.SY

    A Distributed and Resilient Bargaining Game for Weather-Predictive Microgrid Energy Cooperation

    Authors: Lu An, Jie Duan, Mo-Yuen Chow, Alexandra Duel-Hallen

    Abstract: A bargaining game is investigated for cooperative energy management in microgrids. This game incorporates a fully distributed and realistic cooperative power scheduling algorithm (CoDES) as well as a distributed Nash Bargaining Solution (NBS)-based method of allocating the overall power bill resulting from CoDES. A novel weather-based stochastic renewable generation (RG) prediction method is incor… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: 9 pages, 8 figures, published in IEEE Transactions on Industrial Informatics

    Journal ref: IEEE Transactions on Industrial Informatics 15 (8), 4721-4730, 2019

  38. arXiv:2103.05505  [pdf

    eess.SY cs.LG

    Approximate Optimal Filter for Linear Gaussian Time-invariant Systems

    Authors: Kaiming Tang, Shengbo Eben Li, Yuming Yin, Yang Guan, Jingliang Duan, Wenhan Cao, Jie Li

    Abstract: State estimation is critical to control systems, especially when the states cannot be directly measured. This paper presents an approximate optimal filter, which enables to use policy iteration technique to obtain the steady-state gain in linear Gaussian time-invariant systems. This design transforms the optimal filtering problem with minimum mean square error into an optimal control problem, call… ▽ More

    Submitted 9 March, 2021; originally announced March 2021.

  39. arXiv:2102.11736  [pdf, other

    eess.SY cs.AI

    Recurrent Model Predictive Control

    Authors: Zhengyu Liu, Jingliang Duan, Wenxuan Wang, Shengbo Eben Li, Yuming Yin, Ziyu Lin, Qi Sun, Bo Cheng

    Abstract: This paper proposes an off-line algorithm, called Recurrent Model Predictive Control (RMPC), to solve general nonlinear finite-horizon optimal control problems. Unlike traditional Model Predictive Control (MPC) algorithms, it can make full use of the current computing resources and adaptively select the longest model prediction horizon. Our algorithm employs a recurrent function to approximate the… ▽ More

    Submitted 23 February, 2021; originally announced February 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2102.10289

  40. Recurrent Model Predictive Control: Learning an Explicit Recurrent Controller for Nonlinear Systems

    Authors: Zhengyu Liu, Jingliang Duan, Wenxuan Wang, Shengbo Eben Li, Yuming Yin, Ziyu Lin, Bo Cheng

    Abstract: This paper proposes an offline control algorithm, called Recurrent Model Predictive Control (RMPC), to solve large-scale nonlinear finite-horizon optimal control problems. It can be regarded as an explicit solver of traditional Model Predictive Control (MPC) algorithms, which can adaptively select appropriate model prediction horizon according to current computing resources, so as to improve the p… ▽ More

    Submitted 8 April, 2022; v1 submitted 20 February, 2021; originally announced February 2021.

    Journal ref: IEEE Transactions on Industrial Electronics, 2022

  41. arXiv:2102.08539  [pdf, other

    cs.LG cs.AI eess.SY

    Separated Proportional-Integral Lagrangian for Chance Constrained Reinforcement Learning

    Authors: Baiyu Peng, Yao Mu, Jingliang Duan, Yang Guan, Shengbo Eben Li, Jianyu Chen

    Abstract: Safety is essential for reinforcement learning (RL) applied in real-world tasks like autonomous driving. Chance constraints which guarantee the satisfaction of state constraints at a high probability are suitable to represent the requirements in real-world environment with uncertainty. Existing chance constrained RL methods like the penalty method and the Lagrangian method either exhibit periodic… ▽ More

    Submitted 16 February, 2021; originally announced February 2021.

  42. arXiv:2012.11974  [pdf, other

    eess.IV

    Complementary Time-Frequency Domain Networks for Dynamic Parallel MR Image Reconstruction

    Authors: Chen Qin, Jinming Duan, Kerstin Hammernik, Jo Schlemper, Thomas Küstner, René Botnar, Claudia Prieto, Anthony N. Price, Joseph V. Hajnal, Daniel Rueckert

    Abstract: Purpose: To introduce a novel deep learning based approach for fast and high-quality dynamic multi-coil MR reconstruction by learning a complementary time-frequency domain network that exploits spatio-temporal correlations simultaneously from complementary domains. Theory and Methods: Dynamic parallel MR image reconstruction is formulated as a multi-variable minimisation problem, where the data… ▽ More

    Submitted 18 June, 2021; v1 submitted 22 December, 2020; originally announced December 2020.

    Comments: Accepted by Magnetic Resonance in Medicine

  43. arXiv:2012.06458  [pdf, other

    math.OC eess.SY

    On Training Effective Reinforcement Learning Agents for Real-time Power Grid Operation and Control

    Authors: Ruisheng Diao, Di Shi, Bei Zhang, Siqi Wang, Haifeng Li, Chunlei Xu, Tu Lan, Desong Bian, Jiajun Duan

    Abstract: Deriving fast and effectively coordinated control actions remains a grand challenge affecting the secure and economic operation of today's large-scale power grid. This paper presents a novel artificial intelligence (AI) based methodology to achieve multi-objective real-time power grid control for real-world implementation. State-of-the-art off-policy reinforcement learning (RL) algorithm, soft act… ▽ More

    Submitted 11 December, 2020; originally announced December 2020.

  44. arXiv:2009.04395  [pdf, other

    cs.LG eess.SP

    Automated Model Selection for Time-Series Anomaly Detection

    Authors: Yuanxiang Ying, Juanyong Duan, Chunlei Wang, Yujing Wang, Congrui Huang, Bixiong Xu

    Abstract: Time-series anomaly detection is a popular topic in both academia and industrial fields. Many companies need to monitor thousands of temporal signals for their applications and services and require instant feedback and alerts for potential incidents in time. The task is challenging because of the complex characteristics of time-series, which are messy, stochastic, and often without proper labels.… ▽ More

    Submitted 25 August, 2020; originally announced September 2020.

  45. arXiv:2007.06810  [pdf

    eess.SY cs.GT cs.LG

    Ternary Policy Iteration Algorithm for Nonlinear Robust Control

    Authors: Jie Li, Shengbo Eben Li, Yang Guan, Jingliang Duan, Wenyu Li, Yuming Yin

    Abstract: The uncertainties in plant dynamics remain a challenge for nonlinear control problems. This paper develops a ternary policy iteration (TPI) algorithm for solving nonlinear robust control problems with bounded uncertainties. The controller and uncertainty of the system are considered as game players, and the robust control problem is formulated as a two-player zero-sum differential game. In order t… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.

  46. arXiv:2007.05993  [pdf, other

    eess.IV cs.CV

    Deep Network Interpolation for Accelerated Parallel MR Image Reconstruction

    Authors: Chen Qin, Jo Schlemper, Kerstin Hammernik, Jinming Duan, Ronald M Summers, Daniel Rueckert

    Abstract: We present a deep network interpolation strategy for accelerated parallel MR image reconstruction. In particular, we examine the network interpolation in parameter space between a source model that is formulated in an unrolled scheme with L1 and SSIM losses and its counterpart that is trained with an adversarial loss. We show that by interpolating between the two different models of the same netwo… ▽ More

    Submitted 12 July, 2020; originally announced July 2020.

    Comments: Presented at 2020 ISMRM Conference & Exhibition (Abstract #4958)

  47. arXiv:2007.02070  [pdf, other

    eess.SY

    Continuous-time finite-horizon ADP for automated vehicle controller design with high efficiency

    Authors: Ziyu Lin, Jingliang Duan, Shengbo Eben Li, Haitong Ma, Yuming Yin

    Abstract: The design of an automated vehicle controller can be generally formulated into an optimal control problem. This paper proposes a continuous-time finite-horizon approximate dynamicprogramming (ADP) method, which can synthesis off-line near-optimal control policy with analytical vehicle dynamics. Lying on the general Policy Iteration framework, it employs value andpolicy neural networks to approxima… ▽ More

    Submitted 4 July, 2020; originally announced July 2020.

    Comments: 7 pages,conference

  48. Hierarchical Reinforcement Learning for Self-Driving Decision-Making without Reliance on Labeled Driving Data

    Authors: Jingliang Duan, Shengbo Eben Li, Yang Guan, Qi Sun, Bo Cheng

    Abstract: Decision making for self-driving cars is usually tackled by manually encoding rules from drivers' behaviors or imitating drivers' manipulation using supervised learning techniques. Both of them rely on mass driving data to cover all possible driving scenarios. This paper presents a hierarchical reinforcement learning method for decision making of self-driving cars, which does not depend on a large… ▽ More

    Submitted 27 January, 2020; originally announced January 2020.

    Journal ref: IET Intelligent Transport Systems, 2020, 14(5): 297-305

  49. arXiv:2001.02811  [pdf, other

    cs.LG cs.AI eess.SY

    Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors

    Authors: Jingliang Duan, Yang Guan, Shengbo Eben Li, Yangang Ren, Bo Cheng

    Abstract: In reinforcement learning (RL), function approximation errors are known to easily lead to the Q-value overestimations, thus greatly reducing policy performance. This paper presents a distributional soft actor-critic (DSAC) algorithm, which is an off-policy RL method for continuous control setting, to improve the policy performance by mitigating Q-value overestimations. We first discover in theory… ▽ More

    Submitted 11 June, 2021; v1 submitted 8 January, 2020; originally announced January 2020.

    Journal ref: IEEE Transactions on Neural Networks and Learning Systems, 2021

  50. arXiv:1912.09278  [pdf, other

    eess.IV cs.CV cs.LG

    $Σ$-net: Systematic Evaluation of Iterative Deep Neural Networks for Fast Parallel MR Image Reconstruction

    Authors: Kerstin Hammernik, Jo Schlemper, Chen Qin, Jinming Duan, Ronald M. Summers, Daniel Rueckert

    Abstract: Purpose: To systematically investigate the influence of various data consistency layers, (semi-)supervised learning and ensembling strategies, defined in a $Σ$-net, for accelerated parallel MR image reconstruction using deep learning. Theory and Methods: MR image reconstruction is formulated as learned unrolled optimization scheme with a Down-Up network as regularization and varying data consist… ▽ More

    Submitted 18 December, 2019; originally announced December 2019.

    Comments: Submitted to Magnetic Resonance in Medicine