计算机集成制造系统 ›› 2023, Vol. 29 ›› Issue (11): 3669-3680.DOI: 10.13196/j.cims.2022.0435

• • 上一篇    下一篇

基于SARSA算法的机器人轴孔装配策略

李少东1,袁小钢1,牛捷2,3,4+   

  1. 1.广西大学广西电力装备智能控制与运维重点实验室
    2.智能带电作业技术及装备(机器人)湖南省重点实验室
    3.带电巡检与智能作业技术国网公司实验室
    4.国家电网有限公司
  • 出版日期:2023-11-30 发布日期:2023-12-04
  • 基金资助:
    广西研究生教育创新计划资助项目(YCSW2022014);广西自然科学基金—青年基金资助项目(2022JJB170009);国网湖南超高压输电公司2021年实验室开放性课题(2021KZD2002)。

Robotic peg-in-hole assembly strategy research based on SARSA algorithm

LI Shaodong1,YUAN Xiaogang1,NIU Jie2,3,4+   

  1. 1.Guangxi Key Laboratory of Intelligent Control and Maintenance of Power Equipment,Guangxi University,
    2.Hunan Provincial Key Laboratory of Intelligent Live Working Technology and Equipment (Robot)
    3.Live Inspection and Intelligent Operation Technology State Grid Corporation Laboratory
    4.State Grid Corporation of China
  • Online:2023-11-30 Published:2023-12-04
  • Supported by:
    Project supported by the Innovation Project of Guangxi Graduate Education,China(No.YCSW2022014),the Guangxi Natural Science Foundation,China(No.2022JJB170009),and the State Grid Hunan EHV Transmission Line Company Opening Program,China(No.2021KZD2002).

摘要: 为解决机器人装配任务中轴孔位姿不确定问题,提高装配操作的成功率、效率和泛化能力,提出一种基于SARSA算法的变参数导纳控制策略。首先,分析了轴孔装配机理,指导运动控制策略设计。其次,仿真分析了不同导纳参数的位置响应,获得控制器参数。此外,建立了以插孔深度和单次调整移动量为尺度的动作评估机制,解决装配过程奖励函数建模难题,并在动作价值更新过程引入资格迹函数提高算法学习效率。最后,在真实机器人上开展了3组实验,对比基于位置控制和导纳控制的实验结果验证了所提算法在装配成功率和效率上的提升,从不同初始位姿开始装配的实验结果验证了所提算法的泛化能力。结果表明,所提算法有望解决机器人装配任务中轴孔位姿不确定问题。

关键词: 强化学习, 机器人柔顺控制, 轴孔装配, 导纳控制器

Abstract: To solve the problem of uncertain pose (position and pose) of peg and hole for improving the success rate,efficiency,and generalization ability of robot assembly operation,a variable admittance control strategy based on SARSA algorithm was proposed.The mechanism of peg-in-hole assembly was analyzed to guide the design of motion control strategy.Then,the different admittance parameters were analyzed by a series of simulations to obtain the controller parameters.In addition,an action evaluation method combining the displacement in each step and insertion depth was established to solve the problem of complex reward establishment in the assembly process.Meanwhile,qualification trace function is introduced to enhance learning efficiency of SARSA algorithm.Three sets of assembly experiments were implemented on the real robot.The improvement of success rate and efficiency could be verified by position and admittance control experiments.The performance of generalization ability was also validated through experiments in different initial pose.The results indicated that the proposed algorithm could solve the problem of the uncertain pose of peg and hole in the assembly task.

Key words: reinforcement learning, robot compliance control, peg-in-hole assembly, admittance controller

中图分类号: