计算机集成制造系统 ›› 2022, Vol. 28 ›› Issue (6): 1603-1615.DOI: 10.13196/j.cims.2022.06.001

• •    下一篇

基于深度学习的命名实体识别研究

冀振燕1,孔德焱1,刘伟2,董为2,桑艳娟3   

  1. 1.北京交通大学软件学院
    2.中国科学院软件研究所
    3.中科蓝智(武汉)科技有限公司
  • 出版日期:2022-06-30 发布日期:2022-07-05
  • 基金资助:
    国家重点研发计划资助项目(2020YFB1711700);国家自然科学基金资助项目(51935002,52175493)。

Named entity recognition based on deep learning

  • Online:2022-06-30 Published:2022-07-05
  • Supported by:
    Project supported by the National Key Research and Development Program,China(No.2020YFB1711700),and the National Natural Science Foundation,China(No.51935002,52175493).

摘要: 针对先进制造业多模态异构数据并存导致知识抽取和表示困难等问题,命名实体识别方法成为当前的研究热点。综述了工业上常用的命名实体识别方法,首先介绍了传统的命名实体识别方法,然后阐述了已成为命名实体识别领域主流的基于深度学习方法,从分布式输入表示、上下文编码和标签解码器3个步骤分别对常用方法进行了分类和分析。鉴于基于深度学习方法的研究侧重于分布式输入表示和文本上下文编码模型的设计与改进,对分布式输入表示的各种方法进行了对比,指出其优缺点;对文本上下文编码模型从捕获长距离依赖、局部上下文信息、并行性、信息损失程度、可迁移性等方面进行了对比,指出各模型的特点。最后指出未来需要应对的挑战和研究方向。

关键词: 命名实体识别, 深度学习, 分布式输入表示, 上下文编码, 标签解码器

Abstract: To address the problems of knowledge extraction and representation difficulties caused by the coexistence of multimodal heterogeneous data in advanced manufacturing industry,the methods of Named Entity Recognition (NER) methods  become the research hot spot.The commonly used methods of NER in industry was summarized,which had introduced the traditional methods of NER firstly and then expounded the methods based on deep learning.The common methods were classified and analyzed from  three stages of distributed input representation,context encoder and tag decoder.The various methods of distributed input representation were compared,and their advantages and disadvantages were pointed out.The context encoder models from long-distance dependence capture,local context information,parallelism,information loss degree and transportability were also compared,and the characteristics of each model were given.Finally,the challenges to be addressed and research directions in the future were illustrated.

Key words: named entity recognition, deep learning, distributed input representation, context encoder, tag decoder

中图分类号: