计算机集成制造系统 ›› 2018, Vol. 24 ›› Issue (第8): 1996-2004.DOI: 10.13196/j.cims.2018.08.011

• 当期目次 • 上一篇    下一篇

基于增强学习的机械臂轨迹跟踪控制

刘卫朋1,邢关生2,陈海永1,孙鹤旭1   

  1. 1.河北工业大学控制科学与工程学院
    2.青岛科技大学自动化与电子工程学院
  • 出版日期:2018-08-31 发布日期:2018-08-31
  • 基金资助:
    河北省科技计划资助项目(17211804D);天津市教委科研计划资助项目(20140728);河北省自然科学基金资助项目(F2018202078);河北省首批青年拔尖人才支持计划资助项目(210003);天津市自然科学基金资助项目(16JCQNJC04200)。

Robotic trajectory tracking control method based on reinforcement learning

  • Online:2018-08-31 Published:2018-08-31
  • Supported by:
    Project supported by the Technology Program of Hebei Province,China(No.17211804D),the Tianjin Municipal Education Commission Research Program,China(No.20140728),the Natural Science Foundation of Hebei Province,China(No.F2018202078),the Young Talents Program in Hebei Province,China(No.210003),and the Natural Science Foundation of Tianjin Province,China(No.16JCQNJC04200).

摘要: 为了提高机器臂轨迹跟踪控制器的工作性能,提出基于增强学习的机械臂轨迹跟踪控制方法。介绍了增强学习的基本原理,提出基于SARSA算法的增强学习补偿控制策略。利用比例—微分(PD)控制器完成了基本的稳定任务后,再利用增强学习算法实现了对未知干扰因素的补偿,提升了对不同未知情况的适应能力。实验结果验证了自适应离散化增强学习方法在机械臂轨迹跟踪问题中的可行性和有效性,明显提高了控制器的学习速度。

关键词: 机器人, 增强学习, 轨迹跟踪, 比例&mdash, 微分控制器, 前馈神经网络

Abstract: To improve the working performance of robotic trajectory tracking controller,the robotic trajectory tracking control method based on reinforcement learning was proposed.The basic principle of reinforcement learning was introduced,and then the robot trajectory tracking control strategy based on SARSA was proposed.By using the reinforcement learning,the unknown disturbance factors were compensated and the adaptability to the unknown was improved after the PD control method was applied.The experimental results verified the feasibility and effectiveness of the reinforcement learning method in the trajectory tracking problem of robot arms,and the learning speed of the controller was enhanced.

Key words: robot, reinforcement learning, trajectory tracking, PD controller, feedforward neural network

中图分类号: