Receive a weekly summary and discussion of the top papers of the week by leading researchers in the field.

In Frontiers in neuroinformatics

Aiming at the poor robustness and adaptability of traditional control methods for different situations, the deep deterministic policy gradient (DDPG) algorithm is improved by designing a hybrid function that includes different rewards superimposed on each other. In addition, the experience replay mechanism of DDPG is also improved by combining priority sampling and uniform sampling to accelerate the DDPG's convergence. Finally, it is verified in the simulation environment that the improved DDPG algorithm can achieve accurate control of the robot arm motion. The experimental results show that the improved DDPG algorithm can converge in a shorter time, and the average success rate in the robotic arm end-reaching task is as high as 91.27%. Compared with the original DDPG algorithm, it has more robust environmental adaptability.

Dong Ruyi, Du Junjie, Liu Yanan, Heidari Ali Asghar, Chen Huiling

2023

artificial intelligence, deep deterministic policy gradient algorithm, experience replay mechanism, intelligent control, machine learning, reward function, robotic arm