Skip to main content

Showing 1–16 of 16 results for author: Ushio, T

Searching in archive eess. Search in all archives.
.
  1. Convex Estimation of Sparse-Smooth Power Spectral Densities from Mixtures of Realizations with Application to Weather Radar

    Authors: Hiroki Kuroda, Daichi Kitahara, Eiichi Yoshikawa, Hiroshi Kikuchi, Tomoo Ushio

    Abstract: In this paper, we propose a convex optimization-based estimation of sparse and smooth power spectral densities (PSDs) of complex-valued random processes from mixtures of realizations. While the PSDs are related to the magnitude of the frequency components of the realizations, it has been a major challenge to exploit the smoothness of the PSDs, because penalizing the difference of the magnitude of… ▽ More

    Submitted 14 November, 2023; v1 submitted 28 September, 2023; originally announced September 2023.

    Journal ref: IEEE Access, vol. 11, pp. 128859-128874, 2023

  2. arXiv:2204.04383  [pdf, other

    eess.SY cs.FL

    Learning-based Bounded Synthesis for Semi-MDPs with LTL Specifications

    Authors: Ryohei Oura, Toshimitsu Ushio

    Abstract: This letter proposes a learning-based bounded synthesis for a semi-Markov decision process (SMDP) with a linear temporal logic (LTL) specification. In the product of the SMDP and the deterministic $K$-co-Büchi automaton (d$K$cBA) converted from the LTL specification, we learn both the winning region of satisfying the LTL specification and the dynamics therein based on reinforcement learning and Ba… ▽ More

    Submitted 9 April, 2022; originally announced April 2022.

    Comments: 6apges, 4figures

  3. arXiv:2201.08504  [pdf, other

    stat.ML cs.LG eess.SY

    Deep reinforcement learning under signal temporal logic constraints using Lagrangian relaxation

    Authors: Junya Ikemoto, Toshimitsu Ushio

    Abstract: Deep reinforcement learning (DRL) has attracted much attention as an approach to solve optimal control problems without mathematical models of systems. On the other hand, in general, constraints may be imposed on optimal control problems. In this study, we consider the optimal control problems with constraints to complete temporal control tasks. We describe the constraints using signal temporal lo… ▽ More

    Submitted 19 November, 2022; v1 submitted 20 January, 2022; originally announced January 2022.

    Comments: 16 pages, 20 figures, accepted for IEEE Access

  4. arXiv:2108.01317  [pdf, other

    eess.SY cs.LG

    Deep Reinforcement Learning Based Networked Control with Network Delays for Signal Temporal Logic Specifications

    Authors: Junya Ikemoto, Toshimitsu Ushio

    Abstract: We apply deep reinforcement learning (DRL) to design of a networked controller with network delays to complete a temporal control task that is described by a signal temporal logic (STL) formula. STL is useful to deal with a specification with a bounded time interval for a dynamical system. In general, an agent needs not only the current system state but also the past behavior of the system to dete… ▽ More

    Submitted 27 March, 2022; v1 submitted 3 August, 2021; originally announced August 2021.

    Comments: 8 pages, 7 figures, revised for submitting to a conference

  5. Collaborative rover-copter path planning and exploration with temporal logic specifications based on Bayesian update under uncertain environments

    Authors: Kazumune Hashimoto, Natsuko Tsumagari, Toshimitsu Ushio

    Abstract: This paper investigates a collaborative rover-copter path planning and exploration with temporal logic specifications under uncertain environments. The objective of the rover is to complete a mission expressed by a syntactically co-safe linear temporal logic (scLTL) formula, while the objective of the copter is to actively explore the environment and reduce its uncertainties, aiming at assisting t… ▽ More

    Submitted 20 July, 2021; originally announced July 2021.

    Comments: This paper is accepted in ACM Transactions on Cyber Physical Systems (ACM TCPS)

  6. arXiv:2105.03081  [pdf, other

    eess.SY cs.FL

    Bounded Synthesis and Reinforcement Learning of Supervisors for Stochastic Discrete Event Systems with LTL Specifications

    Authors: Ryohei Oura, Toshimitsu Ushio, Ami Sakakibara

    Abstract: In this paper, we consider supervisory control of stochastic discrete event systems (SDESs) under linear temporal logic specifications. Applying the bounded synthesis, we reduce the supervisor synthesis into a problem of satisfying a safety condition. First, we consider a synthesis problem of a directed controller using the safety condition. We assign a negative reward to the unsafe states and int… ▽ More

    Submitted 9 April, 2022; v1 submitted 7 May, 2021; originally announced May 2021.

    Comments: 15 pages, 4 figures, 2 tables, submitted to a journal

  7. arXiv:2101.05640  [pdf, other

    cs.LG eess.SY stat.ML

    Continuous Deep Q-Learning with Simulator for Stabilization of Uncertain Discrete-Time Systems

    Authors: Junya Ikemoto, Toshimitsu Ushio

    Abstract: Applications of reinforcement learning (RL) to stabilization problems of real systems are restricted since an agent needs many experiences to learn an optimal policy and may determine dangerous actions during its exploration. If we know a mathematical model of a real system, a simulator is useful because it predicates behaviors of the real system using the mathematical model with a given system pa… ▽ More

    Submitted 19 April, 2021; v1 submitted 13 January, 2021; originally announced January 2021.

  8. arXiv:2004.01879  [pdf, other

    eess.SY

    Learning-based Symbolic Abstractions for Nonlinear Control Systems

    Authors: Kazumune Hashimoto, Adnane Saoud, Masako Kishida, Toshimitsu Ushio, Dimos Dimarogonas

    Abstract: Symbolic models or abstractions are known to be powerful tools for the control design of cyber-physical systems (CPSs) with logic specifications. In this paper, we investigate a novel learning-based approach to the construction of symbolic models for nonlinear control systems. In particular, the symbolic model is constructed based on learning the un-modeled part of the dynamics from training data… ▽ More

    Submitted 3 August, 2022; v1 submitted 4 April, 2020; originally announced April 2020.

    Comments: Accepted for publication in Automatica

  9. arXiv:2003.12274  [pdf, other

    eess.SY cs.FL

    On-Line Synthesis of Permissive Supervisors for Partially Observed Discrete Event Systems under scLTL Constraints

    Authors: Ami Sakakibara, Toshimitsu Ushio

    Abstract: We consider a supervisory control problem of a discrete event system (DES) under partial observation, where a control specification is given by a fragment of linear temporal logic. We design an on-line supervisor that dynamically computes its control action with the complete information of the product automaton of the DES and an acceptor for the specification. The concepts of controllability and o… ▽ More

    Submitted 27 March, 2020; originally announced March 2020.

    Comments: 7 pages, 2 figures; Accepted for the 21st IFAC World Congress. arXiv admin note: text overlap with arXiv:2003.11808

  10. On-Line Permissive Supervisory Control of Discrete Event Systems for scLTL Specifications

    Authors: Ami Sakakibara, Toshimitsu Ushio

    Abstract: We propose an on-line supervisory control scheme for discrete event systems (DESs), where a control specification is described by a fragment of linear temporal logic. On the product automaton of the DES and an acceptor for the specification, we define a ranking function that returns the minimum number of steps required to reach an accepting state from each state. In addition, we introduce a permis… ▽ More

    Submitted 26 March, 2020; originally announced March 2020.

    Comments: 6 pages, 4 figures

    Journal ref: in IEEE Control Systems Letters, vol. 4, no. 3, pp. 530-535, July 2020

  11. arXiv:2001.04669  [pdf, other

    eess.SY cs.AI cs.LG cs.LO

    Reinforcement Learning of Control Policy for Linear Temporal Logic Specifications Using Limit-Deterministic Generalized Büchi Automata

    Authors: Ryohei Oura, Ami Sakakibara, Toshimitsu Ushio

    Abstract: This letter proposes a novel reinforcement learning method for the synthesis of a control policy satisfying a control specification described by a linear temporal logic formula. We assume that the controlled system is modeled by a Markov decision process (MDP). We convert the specification to a limit-deterministic generalized Büchi automaton (LDGBA) with several accepting sets that accepts all inf… ▽ More

    Submitted 26 March, 2020; v1 submitted 14 January, 2020; originally announced January 2020.

    Comments: 7 pages, 6 figures; an extended version of a manuscript accepted to IEEE L-CSS

  12. arXiv:1912.02513  [pdf, other

    eess.SY

    Control of Timed Discrete Event Systems with Ticked Linear Temporal Logic Constraints

    Authors: Takuma Kinugawa, Kazumune Hashimoto, Toshimitsu Ushio

    Abstract: This paper presents a novel method of synthesizing a fragment of a timed discrete event system(TDES),introducing a novel linear temporal logic(LTL), called ticked LTL$_f$. The ticked LTL$_f$ is given as an extension to LTL$_f$, where the semantics is defined over a finite execution fragment. Differently from the standard LTL$_f$, the formula is defined as a variant of metric temporal logic formula… ▽ More

    Submitted 5 December, 2019; originally announced December 2019.

  13. arXiv:1909.00178  [pdf, other

    eess.SY

    Learning self-triggered controllers with Gaussian processes

    Authors: Kazumune Hashimoto, Yuichi Yoshimura, Toshimitsu Ushio

    Abstract: This paper investigates the design of self-triggered controllers for networked control systems (NCSs), where the dynamics of the plant is \textit{unknown} apriori. To deal with the unknown transition dynamics, we employ the Gaussian process (GP) regression in order to learn the dynamics of the plant. To design the self-triggered controller, we formulate an optimal control problem, such that the op… ▽ More

    Submitted 8 March, 2020; v1 submitted 31 August, 2019; originally announced September 2019.

    Comments: appear in IEEE Transactions on Cybernetics

  14. arXiv:1908.10722  [pdf, other

    cs.LG eess.SY stat.ML

    Networked Control of Nonlinear Systems under Partial Observation Using Continuous Deep Q-Learning

    Authors: Junya Ikemoto, Toshimitsu Ushio

    Abstract: In this paper, we propose a design of a model-free networked controller for a nonlinear plant whose mathematical model is unknown. In a networked control system, the controller and plant are located away from each other and exchange data over a network, which causes network delays that may fluctuate randomly due to network routing. So, in this paper, we assume that the current network delay is not… ▽ More

    Submitted 29 August, 2019; v1 submitted 28 August, 2019; originally announced August 2019.

    Comments: 6 pages, 9 figures, Accepted for presentation in the IEEE Conference on Decision and Control (CDC) 2019

  15. arXiv:1907.07775  [pdf, other

    eess.SY stat.ML

    Model-free Control of Chaos with Continuous Deep Q-learning

    Authors: Junya Ikemoto, Toshimitsu Ushio

    Abstract: The OGY method is one of control methods for a chaotic system. In the method, we have to calculate a stabilizing periodic orbit embedded in its chaotic attractor. Thus, we cannot use this method in the case where a precise mathematical model of the chaotic system cannot be identified. In this case, the delayed feedback control proposed by Pyragas is useful. However, even in the delayed feedback co… ▽ More

    Submitted 24 August, 2019; v1 submitted 16 July, 2019; originally announced July 2019.

    Comments: 7 pages, 8 figures, Submitted to Journal

  16. Output Feedback Controller Design with Symbolic Observers for Cyber-physical Systems

    Authors: Masashi Mizoguchi, Toshimitsu Ushio

    Abstract: In this paper, we design a symbolic output feedback controller of a cyber-physical system (CPS). The physical plant is modeled by an infinite transition system. We consider the situation that a finite abstracted system of the physical plant, called a c-abstracted system, is given. There exists an approximate alternating simulation relation from the c-abstracted system to the physical plant. A desi… ▽ More

    Submitted 15 December, 2016; originally announced December 2016.

    Comments: In Proceedings V2CPS-16, arXiv:1612.04023

    ACM Class: B.5.1; I.2.8

    Journal ref: EPTCS 232, 2016, pp. 37-51