-
Modular Deep Reinforcement Learning with Temporal Logic Specifications
Abstract: We propose an actor-critic, model-free, and online Reinforcement Learning (RL) framework for continuous-state continuous-action Markov Decision Processes (MDPs) when the reward is highly sparse but encompasses a high-level temporal structure. We represent this temporal structure by a finite-state machine and construct an on-the-fly synchronised product with the MDP and the finite machine. The temp… ▽ More
Submitted 22 November, 2019; v1 submitted 23 September, 2019; originally announced September 2019.
Comments: arXiv admin note: text overlap with arXiv:1902.00778
-
arXiv:1812.08017 [pdf, ps, other]
AME Blockchain: An Architecture Design for Closed-Loop Fluid Economy Token System
Abstract: In this white paper, we propose a blockchain-based system, named AME, which is a decentralized infrastructure and application platform with enhanced security and self-management properties. The AME blockchain technology aims to increase the transaction throughput by adopting various optimizations in network transport and storage layers, and to enhance smart contracts with AI algorithm support. We… ▽ More
Submitted 18 December, 2018; originally announced December 2018.
Comments: arXiv admin note: text overlap with arXiv:1805.02707, arXiv:1802.09651, arXiv:1412.7584, arXiv:1809.00554, arXiv:1405.4951 by other authors