Object manipulation system based on image-based reinforcement learning

Sunin Kim, Hyun Jun Jo, Jae Bok Song

Research output: Contribution to journalArticlepeer-review

Abstract

Advances in reinforcement learning algorithms allow robots to learn complex tasks such as object manipulation. However, most of these tasks have been implemented only in simulations. In addition, it is difficult to apply reinforcement learning in the real world because of the difficulty in obtaining the state details for the learning process, such as the position of an object, and collecting large amount of data. Moreover, existing reinforcement learning algorithms are designed to learn a single task, so there is a limit to learning multiple tasks. To address these problems, a novel system is proposed in this study for applications to the real world after learning multiple tasks in the simulation. First, a generative model that converts real-world images into simulation images is proposed, so that simulation-to-real-world transfer wherein the learning results from simulation can be applied directly to the real-world scenarios is possible. Additionally, to learn multiple tasks using images, a reinforcement learning algorithm combining variational auto-encoder and asymmetric actor-critic is developed. To verify this system, experiments are conducted in which the algorithms learned in the simulation are applied to the real world to achieve a success rate of 83.8%; this shows that the proposed system can perform multiple manipulation tasks successfully.

Original languageEnglish
JournalIntelligent Service Robotics
DOIs
Publication statusAccepted/In press - 2022

Keywords

  • Machine learning
  • Object manipulation
  • Reinforcement learning
  • Sim-to-real

ASJC Scopus subject areas

  • Computational Mechanics
  • Engineering (miscellaneous)
  • Mechanical Engineering
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Object manipulation system based on image-based reinforcement learning'. Together they form a unique fingerprint.

Cite this