计算机集成制造系统

• • 上一篇    下一篇

基于改进的Cascade RCNN铸管字符检测算法

王宇1,2,3,徐福丽4+,王怀震5,崔勇6,姜岩4,陶晔6,王译笙1,2,3,张琦4   

  1. 1.中国科学院网络化控制系统重点实验室
    2.中国科学院沈阳自动化研究所
    3.中国科学院机器人与智能制造创新研究院
    4.沈阳工业大学软件学院
    5.上海波士内智能科技有限公司
    6.沈阳中科博微科技股份有限公司

Improved cascade rcnn algorithm for character detection of cast pipe

WANG Yu1,2,3,XU Fuli4+,WANG Huaizhen5,CUI Yong6,JIANG Yan4,TAO Ye6,WANG Yisheng1,2,3,ZHANG Qi4   

  1. 1.Key Laboratory of Networked Control Systems,Chinese Academy of Sciences
    2.Shenyang Institute of Automation,Chinese Academy of Sciences
    3.Institutes for Robotics and Intelligent Manufacturing,Chinese Academy of Sciences
    4.School of Software,Shenyang University of Technology
    5.Shanghai Ballsnow Intelligent Technology Co.,Ltd
    6.Shenyang Zhongke Microcyber Technology Co.,Ltd

摘要: 由于工业现场采集的铸管字符图像存在背景模糊、字符区域占比小、刻字位置不固定、油漆遮挡等问题,导致现有模型的检测精度难以满足工业现场的需求。针对上述问题,提出改进的Cascade RCNN铸管字符检测算法。首先对特征金字塔进行改进,提出融合小目标增强的特征金字塔(Small Target Enhancement Feature Pyramid Networks,STE-FPN),利用多尺度特征融合的特征增强能力丰富铸管小目标字符的特征信息。其次引入自矫正/池化的ResNeSt(Self-Calibrated/Pooling ResNeSt,SCP-ResNeSt)作为特征提取网络,利用自矫正卷积和池化操作以提升背景复杂的铸管字符特征提取效率。最后对级联结构进行改进,引进Mask分支结构,可以自适应地检测字符区域并去除干扰区域,优化了检测结果。将改进后的算法在铸管数据集上进行测试,其平均检测精度mAP为99.1%,比原Cascade RCNN算法提高了2.3%。得到的精度表明改进后的性能优于原算法。

关键词: 铸管字符检测, 背景模糊, Cascade RCNN, ResNeSt

Abstract: Due to the issues of blurred background,small character areas,inconsistent engraving positions,and paint occlusion in the images of cast pipe characters collected from industrial sites,the existing models struggle to meet the detection accuracy requirements of industrial environments.To address these issues,an improved Cascade RCNN algorithm for cast pipe character detection is proposed.Firstly,improvements are made to the feature pyramid by introducing the Small Target Enhancement Feature Pyramid Networks (STE-FPN),which utilize the feature enhancement capability of multi-scale feature fusion to enrich the feature information of small target characters on cast pipes.Secondly,the Self-Calibrated/Pooling ResNeSt (SCP-ResNeSt) is introduced as the feature extraction network,utilizing self-calibrated convolutions and pooling operations to enhance the efficiency of extracting complex background features of cast pipe characters.Lastly,improvements are made to the cascade structure by introducing a Mask branch structure,which adaptively detects character areas and removes interference regions,thus optimizing the detection results.The improved algorithm was tested on the cast pipe dataset,and Mean Average Precision (mAP) is 99.1%,which increased the precision by 2.3% compared to the Cascade RCNN.The precision shows that the improved performance is superior to the original algorithm.

Key words: character detection of cast pipes, blurred background, Cascade RCNN, ResNeSt

中图分类号: