Skip to main content

Showing 1–13 of 13 results for author: Zhan, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2412.02931  [pdf, other

    cs.LG cs.AI eess.SY

    Inverse Delayed Reinforcement Learning

    Authors: Simon Sinong Zhan, Qingyuan Wu, Zhian Ruan, Frank Yang, Philip Wang, Yixuan Wang, Ruochen Jiao, Chao Huang, Qi Zhu

    Abstract: Inverse Reinforcement Learning (IRL) has demonstrated effectiveness in a variety of imitation tasks. In this paper, we introduce an IRL framework designed to extract rewarding features from expert trajectories affected by delayed disturbances. Instead of relying on direct observations, our approach employs an efficient off-policy adversarial training framework to derive expert features and recover… ▽ More

    Submitted 3 December, 2024; originally announced December 2024.

  2. arXiv:2406.16588  [pdf, other

    eess.SY cs.FL

    Switching Controller Synthesis for Hybrid Systems Against STL Formulas

    Authors: Han Su, Shenghua Feng, Sinong Zhan, Naijun Zhan

    Abstract: Switching controllers play a pivotal role in directing hybrid systems (HSs) towards the desired objective, embodying a ``correct-by-construction'' approach to HS design. Identifying these objectives is thus crucial for the synthesis of effective switching controllers. While most of existing works focus on safety and liveness, few of them consider timing constraints. In this paper, we delves into t… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  3. arXiv:2405.02487  [pdf, other

    eess.SY

    Distributed Online Feedback Optimization for Real-time Distribution System Voltage Regulation

    Authors: Sen Zhan, Nikolaos G. Paterakis, Wouter van den Akker, Anne van der Molen, Johan Morren, J. G. Slootweg

    Abstract: We investigate the real-time voltage regulation problem in distribution systems employing online feedback optimization (OFO) with short-range communication between physical neighbours. OFO does not need an accurate grid model nor estimated consumption of non-controllable loads, affords fast calculations, and demonstrates robustness to uncertainties and disturbances, which render it particularly su… ▽ More

    Submitted 11 October, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

  4. arXiv:2403.16132  [pdf, ps, other

    eess.SY cs.LG

    Runtime Monitoring and Fault Detection for Neural Network-Controlled Systems

    Authors: Jianglin Lan, Siyuan Zhan, Ron Patton, Xianxian Zhao

    Abstract: There is an emerging trend in applying deep learning methods to control complex nonlinear systems. This paper considers enhancing the runtime safety of nonlinear systems controlled by neural networks in the presence of disturbance and measurement noise. A robustly stable interval observer is designed to generate sound and precise lower and upper bounds for the neural network, nonlinear function, a… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: Accepted to SAFEPROCESS 2024

  5. arXiv:2402.03141  [pdf, other

    cs.LG cs.AI eess.SY

    Boosting Reinforcement Learning with Strongly Delayed Feedback Through Auxiliary Short Delays

    Authors: Qingyuan Wu, Simon Sinong Zhan, Yixuan Wang, Yuhui Wang, Chung-Wei Lin, Chen Lv, Qi Zhu, Jürgen Schmidhuber, Chao Huang

    Abstract: Reinforcement learning (RL) is challenging in the common case of delays between events and their sensory perceptions. State-of-the-art (SOTA) state augmentation techniques either suffer from state space explosion or performance degeneration in stochastic environments. To address these challenges, we present a novel Auxiliary-Delayed Reinforcement Learning (AD-RL) method that leverages auxiliary ta… ▽ More

    Submitted 5 June, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: ICML 2024

  6. arXiv:2312.00812  [pdf, other

    cs.AI cs.LG eess.SY

    Empowering Autonomous Driving with Large Language Models: A Safety Perspective

    Authors: Yixuan Wang, Ruochen Jiao, Sinong Simon Zhan, Chengtian Lang, Chao Huang, Zhaoran Wang, Zhuoran Yang, Qi Zhu

    Abstract: Autonomous Driving (AD) encounters significant safety hurdles in long-tail unforeseen driving scenarios, largely stemming from the non-interpretability and poor generalization of the deep neural networks within the AD system, particularly in out-of-distribution and uncertain data. To this end, this paper explores the integration of Large Language Models (LLMs) into AD systems, leveraging their rob… ▽ More

    Submitted 22 March, 2024; v1 submitted 27 November, 2023; originally announced December 2023.

    Comments: Accepted to LLMAgent workshop @ICLR2024

  7. arXiv:2311.02227  [pdf, other

    cs.LG cs.AI eess.SY

    State-Wise Safe Reinforcement Learning With Pixel Observations

    Authors: Simon Sinong Zhan, Yixuan Wang, Qingyuan Wu, Ruochen Jiao, Chao Huang, Qi Zhu

    Abstract: In the context of safe exploration, Reinforcement Learning (RL) has long grappled with the challenges of balancing the tradeoff between maximizing rewards and minimizing safety violations, particularly in complex environments with contact-rich or non-smooth dynamics, and when dealing with high-dimensional pixel observations. Furthermore, incorporating state-wise safety constraints in the explorati… ▽ More

    Submitted 11 December, 2023; v1 submitted 3 November, 2023; originally announced November 2023.

    Comments: 10 pages, 5 figures

  8. arXiv:2209.15090  [pdf, other

    eess.SY cs.LG

    Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments

    Authors: Yixuan Wang, Simon Sinong Zhan, Ruochen Jiao, Zhilu Wang, Wanxin Jin, Zhuoran Yang, Zhaoran Wang, Chao Huang, Qi Zhu

    Abstract: It is quite challenging to ensure the safety of reinforcement learning (RL) agents in an unknown and stochastic environment under hard constraints that require the system state not to reach certain specified unsafe regions. Many popular safe RL methods such as those based on the Constrained Markov Decision Process (CMDP) paradigm formulate safety violations in a cost function and try to constrain… ▽ More

    Submitted 13 June, 2023; v1 submitted 29 September, 2022; originally announced September 2022.

    Comments: Accepted to ICML 2023

  9. arXiv:2201.12243  [pdf, other

    cs.LG eess.SY

    Joint Differentiable Optimization and Verification for Certified Reinforcement Learning

    Authors: Yixuan Wang, Simon Zhan, Zhilu Wang, Chao Huang, Zhaoran Wang, Zhuoran Yang, Qi Zhu

    Abstract: In model-based reinforcement learning for safety-critical control systems, it is important to formally certify system properties (e.g., safety, stability) under the learned controller. However, as existing methods typically apply formal verification \emph{after} the controller has been learned, it is sometimes difficult to obtain any certificate, even after many iterations between learning and ver… ▽ More

    Submitted 21 March, 2023; v1 submitted 28 January, 2022; originally announced January 2022.

    Comments: This paper is accepted to International Conference on Cyber-Physical Systems

  10. Distributionally Robust Chance-Constrained Flexibility Planning for Integrated Energy System

    Authors: Sen Zhan, Peng Hou, Guangya Yang

    Abstract: Inflexible combined heat and power (CHP) plants and uncertain wind power production result in excess power in distribution networks, which leads to inverse power flow challenging grid operations. Power-to-X facilities such as electrolysers and electric boilers can offer extra flexibility to the integrated energy system. In this regard, we aim to jointly determine the optimal Power-to-X facility si… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

    Comments: Msc thesis at DTU, submitted to IJEPES

  11. arXiv:2105.01750  [pdf, other

    eess.SY

    Optimal Real-time Coordination of Distributed Energy Resources in Low-voltage Grids

    Authors: Sen Zhan, Johan Morren, Wouter van den Akker, Anne van der Molen, Han Slootweg

    Abstract: This study proposes a real-time distributed energy resource (DER) coordination model that can exploit flexibility from the DERs to solve voltage and overloading issues using both active and reactive power. The model considers time-coupling devices including electric vehicles and heat pumps by deviating as little as possible from their original schedules while prioritizing DERs with the most urgent… ▽ More

    Submitted 6 May, 2021; v1 submitted 4 May, 2021; originally announced May 2021.

    Comments: Submitted for IEEE PES ISGT Europe 2021

  12. arXiv:2102.02059  [pdf

    eess.SY

    Technoeconomic Supplement of P2G Clusters with Hydrogen Pipeline for Coordinated Renewable Energy and HVDC Systems

    Authors: Jiarong Li, Jin Lin, Yonghua Song, Jinyu Xiao, Feng Liu, Yuxuan Zhao, Sen Zhan

    Abstract: Under the downward tendency of prices of renewable energy generators and upward trend of hydrogen demand, this paper studies the technoeconomic supplement of P2G clusters with hydrogen pipeline for HVDC to jointly consume renewable energy. First, the planning and operation constraints of large-capacity P2G clusters is established. On this basis, the multistage coordinated planning model of renewab… ▽ More

    Submitted 1 February, 2021; originally announced February 2021.

  13. arXiv:2001.00191  [pdf, ps, other

    cs.LG eess.SP stat.ML

    Ensemble emotion recognizing with multiple modal physiological signals

    Authors: Jing Zhang, Yong Zhang, Suhua Zhan, Cheng Cheng

    Abstract: Physiological signals that provide the objective repression of human affective states are attracted increasing attention in the emotion recognition field. However, the single signal is difficult to obtain completely and accurately description for emotion. Multiple physiological signals fusing models, building the uniform classification model by means of consistent and complementary information fro… ▽ More

    Submitted 1 January, 2020; originally announced January 2020.

    Comments: under review for Multimedia tools and applications