A deep reinforcement learning algorithm based on modified Twin delay DDPG method for robotic applications

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

2 Citas (Scopus)

Resumen

This paper proposes a deep reinforcement learning algorithm for autonomous robotics, in which we modify twin delay deep deterministic policy gradient (TD3) to adapt for autonomous robots with higher degree freedom in movement. To provide a robot with free movement in the 2D space without collisions against some obstacles, such as wall, a robot is equipped with three cameras. The images captured by camera are used to train Convolutional Neural Networks (CNN) to understand environment with collisions or not-collisions. We added two additional parameters, observation' O', which are images obtained from cameras, and degrees of turns' deg' into the original TD3' s parameters composed of four values: [state's', reward 'r', action 'a' and next-state's' ']. To determine a next action with higher reward from the observation, two additional Neural Networks are constructed, being the first one determines an action from observation and the second one determines degree of turn from the observation and the action. The simulation results under three environments constructed by CoppeliaSim show a good performance of the proposed algorithm, reaching the target with higher rewards, even though the environments are unknown by robots.

Idioma originalInglés
Título de la publicación alojada2021 21st International Conference on Control, Automation and Systems, ICCAS 2021
EditorialIEEE Computer Society
Páginas743-748
Número de páginas6
ISBN (versión digital)9788993215212
DOI
EstadoPublicada - 2021
Evento21st International Conference on Control, Automation and Systems, ICCAS 2021 - Jeju, República de Corea
Duración: 12 oct. 202115 oct. 2021

Serie de la publicación

NombreInternational Conference on Control, Automation and Systems
Volumen2021-October
ISSN (versión impresa)1598-7833

Conferencia

Conferencia21st International Conference on Control, Automation and Systems, ICCAS 2021
País/TerritorioRepública de Corea
CiudadJeju
Período12/10/2115/10/21

Huella

Profundice en los temas de investigación de 'A deep reinforcement learning algorithm based on modified Twin delay DDPG method for robotic applications'. En conjunto forman una huella única.

Citar esto