Skip to main content

Showing 1–16 of 16 results for author: Ohmura, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.17624  [pdf, ps, other

    cs.RO

    Imitation Learning for Active Neck Motion Enabling Robot Manipulation beyond the Field of View

    Authors: Koki Nakagawa, Yoshiyuki Ohmura, Yasuo Kuniyoshi

    Abstract: Most prior research in deep imitation learning has predominantly utilized fixed cameras for image input, which constrains task performance to the predefined field of view. However, enabling a robot to actively maneuver its neck can significantly expand the scope of imitation learning to encompass a wider variety of tasks and expressive actions such as neck gestures. To facilitate imitation learnin… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

    Comments: 6 pages

  2. arXiv:2506.04668  [pdf, ps, other

    cs.CV cs.AI

    Feature-Based Lie Group Transformer for Real-World Applications

    Authors: Takayuki Komatsu, Yoshiyuki Ohmura, Kayato Nishitsunoi, Yasuo Kuniyoshi

    Abstract: The main goal of representation learning is to acquire meaningful representations from real-world sensory inputs without supervision. Representation learning explains some aspects of human development. Various neural network (NN) models have been proposed that acquire empirically good representations. However, the formulation of a good representation has not been established. We recently proposed… ▽ More

    Submitted 9 June, 2025; v1 submitted 5 June, 2025; originally announced June 2025.

    Comments: 8 pages, the dataset used in this work is https://drive.google.com/file/d/1RaSWNN2GEyV3zQPeGya4Mr9DDhJ7OMz7/view?usp=sharing

  3. arXiv:2504.04490  [pdf, other

    cs.CV

    Learning Conditionally Independent Transformations using Normal Subgroups in Group Theory

    Authors: Kayato Nishitsunoi, Yoshiyuki Ohmura, Takayuki Komatsu, Yasuo Kuniyoshi

    Abstract: Humans develop certain cognitive abilities to recognize objects and their transformations without explicit supervision, highlighting the importance of unsupervised representation learning. A fundamental challenge in unsupervised representation learning is to separate different transformations in learned feature representations. Although algebraic approaches have been explored, a comprehensive theo… ▽ More

    Submitted 6 April, 2025; originally announced April 2025.

    Comments: 8 pages, 10 figures, conference paper

  4. arXiv:2502.18121  [pdf, ps, other

    cs.RO cs.CV

    Enhancing Reusability of Learned Skills for Robot Manipulation via Gaze Information and Motion Bottlenecks

    Authors: Ryo Takizawa, Izumi Karino, Koki Nakagawa, Yoshiyuki Ohmura, Yasuo Kuniyoshi

    Abstract: Autonomous agents capable of diverse object manipulations should be able to acquire a wide range of manipulation skills with high reusability. Although advances in deep learning have made it increasingly feasible to replicate the dexterity of human teleoperation in robots, generalizing these acquired skills to previously unseen scenarios remains a significant challenge. In this study, we propose a… ▽ More

    Submitted 26 August, 2025; v1 submitted 25 February, 2025; originally announced February 2025.

  5. arXiv:2502.08098  [pdf, ps, other

    cs.LG cs.NE

    Unsupervised categorization of similarity measures

    Authors: Yoshiyuki Ohmura, Wataru Shimaya, Yasuo Kuniyoshi

    Abstract: In general, objects can be distinguished on the basis of their features, such as color or shape. In particular, it is assumed that similarity judgments about such features can be processed independently in different metric spaces. However, the unsupervised categorization mechanism of metric spaces corresponding to object features remains unknown. Here, we show that the artificial neural network sy… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

    Comments: arXiv admin note: substantial text overlap with arXiv:2306.00239

  6. arXiv:2501.15071  [pdf, other

    cs.RO

    Gaze-Guided Task Decomposition for Imitation Learning in Robotic Manipulation

    Authors: Ryo Takizawa, Yoshiyuki Ohmura, Yasuo Kuniyoshi

    Abstract: In imitation learning for robotic manipulation, decomposing object manipulation tasks into sub-tasks enables the reuse of learned skills and the combination of learned behaviors to perform novel tasks, rather than simply replicating demonstrated motions. Human gaze is closely linked to hand movements during object manipulation. We hypothesize that an imitating agent's gaze control, fixating on spe… ▽ More

    Submitted 26 February, 2025; v1 submitted 24 January, 2025; originally announced January 2025.

  7. arXiv:2401.07603  [pdf, other

    cs.RO cs.AI

    Multi-task real-robot data with gaze attention for dual-arm fine manipulation

    Authors: Heecheol Kim, Yoshiyuki Ohmura, Yasuo Kuniyoshi

    Abstract: In the field of robotic manipulation, deep imitation learning is recognized as a promising approach for acquiring manipulation skills. Additionally, learning from diverse robot datasets is considered a viable method to achieve versatility and adaptability. In such research, by learning various tasks, robots achieved generality across multiple objects. However, such multi-task robot datasets have m… ▽ More

    Submitted 19 March, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

    Comments: 10 pages, The dataset is available at https://sites.google.com/view/multi-task-fine

  8. arXiv:2310.03273  [pdf, other

    cs.CV cs.LG eess.IV

    Ablation Study to Clarify the Mechanism of Object Segmentation in Multi-Object Representation Learning

    Authors: Takayuki Komatsu, Yoshiyuki Ohmura, Yasuo Kuniyoshi

    Abstract: Multi-object representation learning aims to represent complex real-world visual input using the composition of multiple objects. Representation learning methods have often used unsupervised learning to segment an input image into individual objects and encode these objects into each latent vector. However, it is not clear how previous methods have achieved the appropriate segmentation of individu… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  9. arXiv:2203.11210  [pdf, other

    cs.CV cs.AI

    Disentangling Patterns and Transformations from One Sequence of Images with Shape-invariant Lie Group Transformer

    Authors: T. Takada, W. Shimaya, Y. Ohmura, Y. Kuniyoshi

    Abstract: An effective way to model the complex real world is to view the world as a composition of basic components of objects and transformations. Although humans through development understand the compositionality of the real world, it is extremely difficult to equip robots with such a learning mechanism. In recent years, there has been significant research on autonomously learning representations of the… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

    Comments: 6 pages, 6 figures

    MSC Class: I.2.6

  10. Goal-conditioned dual-action imitation learning for dexterous dual-arm robot manipulation

    Authors: Heecheol Kim, Yoshiyuki Ohmura, Yasuo Kuniyoshi

    Abstract: Long-horizon dexterous robot manipulation of deformable objects, such as banana peeling, is a problematic task because of the difficulties in object modeling and a lack of knowledge about stable and dexterous manipulation skills. This paper presents a goal-conditioned dual-action (GC-DA) deep imitation learning (DIL) approach that can learn dexterous manipulation skills using human demonstration d… ▽ More

    Submitted 21 May, 2025; v1 submitted 18 March, 2022; originally announced March 2022.

    Comments: 19 pages, published in Transactions on Robotics (T-RO)

  11. arXiv:2202.09574  [pdf, other

    cs.RO cs.AI

    Training Robots without Robots: Deep Imitation Learning for Master-to-Robot Policy Transfer

    Authors: Heecheol Kim, Yoshiyuki Ohmura, Akihiko Nagakubo, Yasuo Kuniyoshi

    Abstract: Deep imitation learning is promising for robot manipulation because it only requires demonstration samples. In this study, deep imitation learning is applied to tasks that require force feedback. However, existing demonstration methods have deficiencies; bilateral teleoperation requires a complex control scheme and is expensive, and kinesthetic teaching suffers from visual distractions from human… ▽ More

    Submitted 26 February, 2024; v1 submitted 19 February, 2022; originally announced February 2022.

    Comments: 8 pages

    Journal ref: IEEE Robotics and Automation Letters 8.5 (2023): 2906-2913

  12. arXiv:2202.04877  [pdf, other

    cs.RO cs.AI cs.CV

    Memory-based gaze prediction in deep imitation learning for robot manipulation

    Authors: Heecheol Kim, Yoshiyuki Ohmura, Yasuo Kuniyoshi

    Abstract: Deep imitation learning is a promising approach that does not require hard-coded control rules in autonomous robot manipulation. The current applications of deep imitation learning to robot manipulation have been limited to reactive control based on the states at the current time step. However, future robots will also be required to solve tasks utilizing their memory obtained by experience in comp… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

    Comments: 7 pages. Accepted in 2022 IEEE/RSJ International Conference on Robotics and Automation (ICRA)

  13. arXiv:2109.10501  [pdf, other

    cs.RO

    Third-party Evaluation of Robotic Hand Designs Using a Mechanical Glove

    Authors: Takayuki Kanai, Yoshiyuki Ohmura, Akihiko Nagakubo, Yasuo Kuniyoshi

    Abstract: A robotic hand design suitable for dexterity should be examined using functional tests. To achieve this, we designed a mechanical glove, which is a rigid wearable glove that enables us to develop the corresponding isomorphic robotic hand and evaluate its hardware properties. Subsequently, the effectiveness of multiple degrees-of-freedom (DOFs) was evaluated by human participants. Several fine moto… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

    Comments: 5 pages, 7 figures

    Journal ref: Journal of the Robotics Society of Japan, Vol.39, No.6, pp.557-560, 2021

  14. arXiv:2108.00385  [pdf, other

    cs.RO cs.AI

    Transformer-based deep imitation learning for dual-arm robot manipulation

    Authors: Heecheol Kim, Yoshiyuki Ohmura, Yasuo Kuniyoshi

    Abstract: Deep imitation learning is promising for solving dexterous manipulation tasks because it does not require an environment model and pre-programmed robot behavior. However, its application to dual-arm manipulation tasks remains challenging. In a dual-arm manipulation setup, the increased number of state dimensions caused by the additional robot manipulators causes distractions and results in poor pe… ▽ More

    Submitted 21 May, 2025; v1 submitted 1 August, 2021; originally announced August 2021.

    Comments: 8 pages. Accepted in 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  15. Gaze-based dual resolution deep imitation learning for high-precision dexterous robot manipulation

    Authors: Heecheol Kim, Yoshiyuki Ohmura, Yasuo Kuniyoshi

    Abstract: A high-precision manipulation task, such as needle threading, is challenging. Physiological studies have proposed connecting low-resolution peripheral vision and fast movement to transport the hand into the vicinity of an object, and using high-resolution foveated vision to achieve the accurate homing of the hand to the object. The results of this study demonstrate that a deep imitation learning b… ▽ More

    Submitted 21 May, 2025; v1 submitted 1 February, 2021; originally announced February 2021.

    Comments: 8 pages. The supplementary video can be found at: https://www.youtube.com/watch?v=ytpChcFqD5g Published in IEEE Robotics and Automation Letters. Replaced to add video url in the manuscript

    Journal ref: IEEE Robotics and Automation Letters, Vol. 6, No. 2, 2021

  16. Identifying Critical States by the Action-Based Variance of Expected Return

    Authors: Izumi Karino, Yoshiyuki Ohmura, Yasuo Kuniyoshi

    Abstract: The balance of exploration and exploitation plays a crucial role in accelerating reinforcement learning (RL). To deploy an RL agent in human society, its explainability is also essential. However, basic RL approaches have difficulties in deciding when to choose exploitation as well as in extracting useful points for a brief explanation of its operation. One reason for the difficulties is that thes… ▽ More

    Submitted 8 November, 2020; v1 submitted 25 August, 2020; originally announced August 2020.

    Comments: 12 pages, 6 figures