Computer Integrated Manufacturing System ›› 2024, Vol. 30 ›› Issue (12): 4282-4291.DOI: 10.13196/j.cims.2023.0369
Previous Articles Next Articles
LI Yong,ZHANG Chaoxing+,CHAI Liaoning
Online:
Published:
Supported by:
李勇,张朝兴+,柴燎宁
作者简介:
基金资助:
Abstract: To improve the obstacle avoidance trajectory planning ability of mobile robotic arm in narrow channel and obstacle constraint situations,by combining Artificial Potential Field method (APF) and Deep Deterministic Policy Gradient algorithm (DDPG),an improved algorithm named APF-DDPG was proposed.The APF planning was designed for the robotic arm to get the approximate pose,and the research problem was represented as a Markov decision process.The state space,action space and reward and punishment functions were designed,and the planning process was analyzed and processed in phases.A mechanism for guiding was designed to transition the various control phases,which the obstacle avoidance phase of the training was dominated by DDPG,and the approximate pose dominated the goal planning phase to guide the DDPG for the training.Thus the strategy model for planning was obtained from the training.Finally,simulation experiments of fixed and random state scenarios were established and designed to verify the effectiveness of the proposed algorithm.The experimental results showed that APF-DDPG algorithm could be trained with higher convergence efficiency to obtain a policy model with more efficient control performance by comparing with the traditional DDPG algorithm.
Key words: mobile robotic arm, obstacle avoidance trajectory planning, artificial potential field, deep deterministic policy gradient, guided training
摘要: 为了提高移动机械臂在狭窄通道和障碍物约束情况的避障轨迹规划能力,提出一种人工势场法(APF)和深度确定性策略梯度算法(DDPG)结合的改进算法(APF-DDPG)。首先,对机械臂设计了APF规划得到近似姿态,再将研究问题表示为马尔科夫决策过程,设计了状态空间、动作空间和奖惩函数,对规划过程进行阶段性分析处理,设计了一种引导机制来过渡各控制阶段,即避障阶段由DDPG主导训练,目标规划阶段由近似姿态引导DDPG训练,最终获得用于规划的策略模型。最后,建立并设计了固定和随机状态场景的仿真实验,验证了所提算法的有效性。实验结果表明,相较于传统DDPG算法,APF-DDPG算法能够以更高收敛效率训练得到具有更高效控制性能的策略模型。
关键词: 移动机械臂, 避障轨迹规划, 人工势场法, 深度确定性策略梯度, 引导训练
CLC Number:
TP241
TP18
LI Yong, ZHANG Chaoxing, CHAI Liaoning. Collaborative obstacle avoidance trajectory planning for mobile robotic arms based on artificial potential field DDPG algorithm[J]. Computer Integrated Manufacturing System, 2024, 30(12): 4282-4291.
李勇, 张朝兴, 柴燎宁. 基于人工势场DDPG算法的移动机械臂协同避障轨迹规划[J]. 计算机集成制造系统, 2024, 30(12): 4282-4291.
0 / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: http://www.cims-journal.cn/EN/10.13196/j.cims.2023.0369
http://www.cims-journal.cn/EN/Y2024/V30/I12/4282