A deep reinforcement learning algorithm based on modified Twin delay DDPG method for robotic applications

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

This paper proposes a deep reinforcement learning algorithm for autonomous robotics, in which we modify twin delay deep deterministic policy gradient (TD3) to adapt for autonomous robots with higher degree freedom in movement. To provide a robot with free movement in the 2D space without collisions against some obstacles, such as wall, a robot is equipped with three cameras. The images captured by camera are used to train Convolutional Neural Networks (CNN) to understand environment with collisions or not-collisions. We added two additional parameters, observation' O', which are images obtained from cameras, and degrees of turns' deg' into the original TD3' s parameters composed of four values: [state's', reward 'r', action 'a' and next-state's' ']. To determine a next action with higher reward from the observation, two additional Neural Networks are constructed, being the first one determines an action from observation and the second one determines degree of turn from the observation and the action. The simulation results under three environments constructed by CoppeliaSim show a good performance of the proposed algorithm, reaching the target with higher rewards, even though the environments are unknown by robots.

Original languageEnglish
Title of host publication2021 21st International Conference on Control, Automation and Systems, ICCAS 2021
PublisherIEEE Computer Society
Pages743-748
Number of pages6
ISBN (Electronic)9788993215212
DOIs
StatePublished - 2021
Event21st International Conference on Control, Automation and Systems, ICCAS 2021 - Jeju, Korea, Republic of
Duration: 12 Oct 202115 Oct 2021

Publication series

NameInternational Conference on Control, Automation and Systems
Volume2021-October
ISSN (Print)1598-7833

Conference

Conference21st International Conference on Control, Automation and Systems, ICCAS 2021
Country/TerritoryKorea, Republic of
CityJeju
Period12/10/2115/10/21

Keywords

  • Actor-Critic
  • Deep Q-Learning
  • Deep Reinforcement Learning
  • Policy Gradient
  • Robot Vision

Fingerprint

Dive into the research topics of 'A deep reinforcement learning algorithm based on modified Twin delay DDPG method for robotic applications'. Together they form a unique fingerprint.

Cite this