Skip to main content

Showing 1–7 of 7 results for author: Taniguchi, T

Searching in archive stat. Search in all archives.
.
  1. arXiv:2007.14535  [pdf, other

    cs.LG cs.AI eess.SY stat.ML

    Dreaming: Model-based Reinforcement Learning by Latent Imagination without Reconstruction

    Authors: Masashi Okada, Tadahiro Taniguchi

    Abstract: In the present paper, we propose a decoder-free extension of Dreamer, a leading model-based reinforcement learning (MBRL) method from pixels. Dreamer is a sample- and cost-efficient solution to robot learning, as it is used to train latent state-space models based on a variational autoencoder and to conduct policy optimization by latent trajectory imagination. However, this autoencoding based appr… ▽ More

    Submitted 11 March, 2021; v1 submitted 28 July, 2020; originally announced July 2020.

    Comments: Accepted to ICRA2021. Camera ready version

  2. arXiv:2003.00370  [pdf, other

    cs.LG cs.NE cs.RO stat.ML

    PlaNet of the Bayesians: Reconsidering and Improving Deep Planning Network by Incorporating Bayesian Inference

    Authors: Masashi Okada, Norio Kosaka, Tadahiro Taniguchi

    Abstract: In the present paper, we propose an extension of the Deep Planning Network (PlaNet), also referred to as PlaNet of the Bayesians (PlaNet-Bayes). There has been a growing demand in model predictive control (MPC) in partially observable environments in which complete information is unavailable because of, for example, lack of expensive sensors. PlaNet is a promising solution to realize such latent M… ▽ More

    Submitted 29 February, 2020; originally announced March 2020.

  3. arXiv:2001.11628  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Domain-Adversarial and Conditional State Space Model for Imitation Learning

    Authors: Ryo Okumura, Masashi Okada, Tadahiro Taniguchi

    Abstract: State representation learning (SRL) in partially observable Markov decision processes has been studied to learn abstract features of data useful for robot control tasks. For SRL, acquiring domain-agnostic states is essential for achieving efficient imitation learning. Without these states, imitation learning is hampered by domain-dependent information useless for control. However, existing methods… ▽ More

    Submitted 4 June, 2021; v1 submitted 30 January, 2020; originally announced January 2020.

    Comments: Published at IROS 2020

  4. arXiv:1907.04202  [pdf, other

    cs.LG eess.SY stat.ML

    Variational Inference MPC for Bayesian Model-based Reinforcement Learning

    Authors: Masashi Okada, Tadahiro Taniguchi

    Abstract: In recent studies on model-based reinforcement learning (MBRL), incorporating uncertainty in forward dynamics is a state-of-the-art strategy to enhance learning performance, making MBRLs competitive to cutting-edge model free methods, especially in simulated robotics tasks. Probabilistic ensembles with trajectory sampling (PETS) is a leading type of MBRL, which employs Bayesian inference to dynami… ▽ More

    Submitted 6 October, 2019; v1 submitted 7 July, 2019; originally announced July 2019.

    Comments: Accepted to CoRL2019. Camera-ready ver

  5. Integration of Imitation Learning using GAIL and Reinforcement Learning using Task-achievement Rewards via Probabilistic Graphical Model

    Authors: Akira Kinose, Tadahiro Taniguchi

    Abstract: Integration of reinforcement learning and imitation learning is an important problem that has been studied for a long time in the field of intelligent robotics. Reinforcement learning optimizes policies to maximize the cumulative reward, whereas imitation learning attempts to extract general knowledge about the trajectories demonstrated by experts, i.e., demonstrators. Because each of them has the… ▽ More

    Submitted 16 October, 2019; v1 submitted 3 July, 2019; originally announced July 2019.

    Comments: Submitted to Advanced Robotics

    Journal ref: Advanced Robotics, 2020, 34:16, 1055-1067

  6. arXiv:1510.00331  [pdf, other

    cs.RO cs.AI stat.ML

    Multimodal Hierarchical Dirichlet Process-based Active Perception

    Authors: Tadahiro Taniguchi, Toshiaki Takano, Ryo Yoshino

    Abstract: In this paper, we propose an active perception method for recognizing object categories based on the multimodal hierarchical Dirichlet process (MHDP). The MHDP enables a robot to form object categories using multimodal information, e.g., visual, auditory, and haptic information, which can be observed by performing actions on an object. However, performing many actions on a target object requires a… ▽ More

    Submitted 14 January, 2016; v1 submitted 1 October, 2015; originally announced October 2015.

    Comments: submitted

    Journal ref: Front. Neurorobot. 12:22. 2018

  7. arXiv:1506.06646  [pdf, ps, other

    cs.AI cs.CL cs.LG stat.ML

    Nonparametric Bayesian Double Articulation Analyzer for Direct Language Acquisition from Continuous Speech Signals

    Authors: Tadahiro Taniguchi, Ryo Nakashima, Shogo Nagasaka

    Abstract: Human infants can discover words directly from unsegmented speech signals without any explicitly labeled data. In this paper, we develop a novel machine learning method called nonparametric Bayesian double articulation analyzer (NPB-DAA) that can directly acquire language and acoustic models from observed continuous speech signals. For this purpose, we propose an integrative generative model that… ▽ More

    Submitted 9 March, 2016; v1 submitted 22 June, 2015; originally announced June 2015.

    Comments: 15 pages, 7 figures, Draft submitted to IEEE Transactions on Autonomous Mental Development (TAMD)

    Journal ref: IEEE Transactions on Cognitive and Developmental Systems, vol. 8, no. 3, pp. 171-185, 2016