Skip to main content

Showing 1–13 of 13 results for author: Ushio, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2204.04383  [pdf, other

    eess.SY cs.FL

    Learning-based Bounded Synthesis for Semi-MDPs with LTL Specifications

    Authors: Ryohei Oura, Toshimitsu Ushio

    Abstract: This letter proposes a learning-based bounded synthesis for a semi-Markov decision process (SMDP) with a linear temporal logic (LTL) specification. In the product of the SMDP and the deterministic $K$-co-Büchi automaton (d$K$cBA) converted from the LTL specification, we learn both the winning region of satisfying the LTL specification and the dynamics therein based on reinforcement learning and Ba… ▽ More

    Submitted 9 April, 2022; originally announced April 2022.

    Comments: 6apges, 4figures

  2. arXiv:2201.08504  [pdf, other

    stat.ML cs.LG eess.SY

    Deep reinforcement learning under signal temporal logic constraints using Lagrangian relaxation

    Authors: Junya Ikemoto, Toshimitsu Ushio

    Abstract: Deep reinforcement learning (DRL) has attracted much attention as an approach to solve optimal control problems without mathematical models of systems. On the other hand, in general, constraints may be imposed on optimal control problems. In this study, we consider the optimal control problems with constraints to complete temporal control tasks. We describe the constraints using signal temporal lo… ▽ More

    Submitted 19 November, 2022; v1 submitted 20 January, 2022; originally announced January 2022.

    Comments: 16 pages, 20 figures, accepted for IEEE Access

  3. arXiv:2108.01317  [pdf, other

    eess.SY cs.LG

    Deep Reinforcement Learning Based Networked Control with Network Delays for Signal Temporal Logic Specifications

    Authors: Junya Ikemoto, Toshimitsu Ushio

    Abstract: We apply deep reinforcement learning (DRL) to design of a networked controller with network delays to complete a temporal control task that is described by a signal temporal logic (STL) formula. STL is useful to deal with a specification with a bounded time interval for a dynamical system. In general, an agent needs not only the current system state but also the past behavior of the system to dete… ▽ More

    Submitted 27 March, 2022; v1 submitted 3 August, 2021; originally announced August 2021.

    Comments: 8 pages, 7 figures, revised for submitting to a conference

  4. arXiv:2105.03081  [pdf, other

    eess.SY cs.FL

    Bounded Synthesis and Reinforcement Learning of Supervisors for Stochastic Discrete Event Systems with LTL Specifications

    Authors: Ryohei Oura, Toshimitsu Ushio, Ami Sakakibara

    Abstract: In this paper, we consider supervisory control of stochastic discrete event systems (SDESs) under linear temporal logic specifications. Applying the bounded synthesis, we reduce the supervisor synthesis into a problem of satisfying a safety condition. First, we consider a synthesis problem of a directed controller using the safety condition. We assign a negative reward to the unsafe states and int… ▽ More

    Submitted 9 April, 2022; v1 submitted 7 May, 2021; originally announced May 2021.

    Comments: 15 pages, 4 figures, 2 tables, submitted to a journal

  5. Stability analysis and control of decision-making of miners in blockchain

    Authors: Kosuke Toda, Naomi Kuze, Toshimitsu Ushio

    Abstract: To maintain blockchain-based services with ensuring its security, it is an important issue how to decide a mining reward so that the number of miners participating in the mining increases. We propose a dynamical model of decision-making for miners using an evolutionary game approach and analyze the stability of equilibrium points of the proposed model. The proposed model is described by the 1st-or… ▽ More

    Submitted 24 September, 2021; v1 submitted 17 April, 2021; originally announced April 2021.

    Comments: 8 pages, 4 figures, accepted to a journal

  6. arXiv:2101.05640  [pdf, other

    cs.LG eess.SY stat.ML

    Continuous Deep Q-Learning with Simulator for Stabilization of Uncertain Discrete-Time Systems

    Authors: Junya Ikemoto, Toshimitsu Ushio

    Abstract: Applications of reinforcement learning (RL) to stabilization problems of real systems are restricted since an agent needs many experiences to learn an optimal policy and may determine dangerous actions during its exploration. If we know a mathematical model of a real system, a simulator is useful because it predicates behaviors of the real system using the mathematical model with a given system pa… ▽ More

    Submitted 19 April, 2021; v1 submitted 13 January, 2021; originally announced January 2021.

  7. Game-theoric approach to decision-making problem for blockchain mining

    Authors: Kosuke Toda, Naomi Kuze, Toshimitsu Ushio

    Abstract: It is an important decision-making problem for a miner in the blockchain networks if he/she participates in the mining so that he/she earns a reward by creating a new block earlier than other miners. We formulate this decision-making problem as a noncooperative game, because the probability of creating a block depends not only on one's own available computational resources, but also those of other… ▽ More

    Submitted 11 December, 2020; v1 submitted 11 October, 2020; originally announced October 2020.

    Comments: 7 pages, 6 figures an extended version of a manuscript accepted to IEEE L-CSS

  8. arXiv:2003.12274  [pdf, other

    eess.SY cs.FL

    On-Line Synthesis of Permissive Supervisors for Partially Observed Discrete Event Systems under scLTL Constraints

    Authors: Ami Sakakibara, Toshimitsu Ushio

    Abstract: We consider a supervisory control problem of a discrete event system (DES) under partial observation, where a control specification is given by a fragment of linear temporal logic. We design an on-line supervisor that dynamically computes its control action with the complete information of the product automaton of the DES and an acceptor for the specification. The concepts of controllability and o… ▽ More

    Submitted 27 March, 2020; originally announced March 2020.

    Comments: 7 pages, 2 figures; Accepted for the 21st IFAC World Congress. arXiv admin note: text overlap with arXiv:2003.11808

  9. On-Line Permissive Supervisory Control of Discrete Event Systems for scLTL Specifications

    Authors: Ami Sakakibara, Toshimitsu Ushio

    Abstract: We propose an on-line supervisory control scheme for discrete event systems (DESs), where a control specification is described by a fragment of linear temporal logic. On the product automaton of the DES and an acceptor for the specification, we define a ranking function that returns the minimum number of steps required to reach an accepting state from each state. In addition, we introduce a permis… ▽ More

    Submitted 26 March, 2020; originally announced March 2020.

    Comments: 6 pages, 4 figures

    Journal ref: in IEEE Control Systems Letters, vol. 4, no. 3, pp. 530-535, July 2020

  10. arXiv:2001.04669  [pdf, other

    eess.SY cs.AI cs.LG cs.LO

    Reinforcement Learning of Control Policy for Linear Temporal Logic Specifications Using Limit-Deterministic Generalized Büchi Automata

    Authors: Ryohei Oura, Ami Sakakibara, Toshimitsu Ushio

    Abstract: This letter proposes a novel reinforcement learning method for the synthesis of a control policy satisfying a control specification described by a linear temporal logic formula. We assume that the controlled system is modeled by a Markov decision process (MDP). We convert the specification to a limit-deterministic generalized Büchi automaton (LDGBA) with several accepting sets that accepts all inf… ▽ More

    Submitted 26 March, 2020; v1 submitted 14 January, 2020; originally announced January 2020.

    Comments: 7 pages, 6 figures; an extended version of a manuscript accepted to IEEE L-CSS

  11. arXiv:1908.10722  [pdf, other

    cs.LG eess.SY stat.ML

    Networked Control of Nonlinear Systems under Partial Observation Using Continuous Deep Q-Learning

    Authors: Junya Ikemoto, Toshimitsu Ushio

    Abstract: In this paper, we propose a design of a model-free networked controller for a nonlinear plant whose mathematical model is unknown. In a networked control system, the controller and plant are located away from each other and exchange data over a network, which causes network delays that may fluctuate randomly due to network routing. So, in this paper, we assume that the current network delay is not… ▽ More

    Submitted 29 August, 2019; v1 submitted 28 August, 2019; originally announced August 2019.

    Comments: 6 pages, 9 figures, Accepted for presentation in the IEEE Conference on Decision and Control (CDC) 2019

  12. arXiv:1701.06247  [pdf, ps, other

    cs.CL cs.AI cs.LG

    A Multichannel Convolutional Neural Network For Cross-language Dialog State Tracking

    Authors: Hongjie Shi, Takashi Ushio, Mitsuru Endo, Katsuyoshi Yamagami, Noriaki Horii

    Abstract: The fifth Dialog State Tracking Challenge (DSTC5) introduces a new cross-language dialog state tracking scenario, where the participants are asked to build their trackers based on the English training corpus, while evaluating them with the unlabeled Chinese corpus. Although the computer-generated translations for both English and Chinese corpus are provided in the dataset, these translations conta… ▽ More

    Submitted 22 January, 2017; originally announced January 2017.

    Comments: Copyright 2016 IEEE. Published in the 2016 IEEE Workshop on Spoken Language Technology (SLT 2016)

  13. Game Theoretic Approach to the Stabilization of Heterogeneous Multiagent Systems Using Subsidy

    Authors: Takuya Morimoto, Takafumi Kanazawa, Toshimitsu Ushio

    Abstract: We consider a multiagent system consisting of selfish and heterogeneous agents. Its behavior is modeled by multipopulation replicator dynamics, where payoff functions of populations are different from each other. In general, there exist several equilibrium points in the replicator dynamics. In order to stabilize a desirable equilibrium point, we introduce a controller called a government which con… ▽ More

    Submitted 24 December, 2013; originally announced December 2013.

    Comments: 6 pages, IEEE Conference on Decision and Control, 2013