Computer Integrated Manufacturing System ›› 2022, Vol. 28 ›› Issue (10): 3225-3238.DOI: 10.13196/j.cims.2022.10.018

Previous Articles     Next Articles

Automatic business process model deep generation based on ordered neurons long short term memory

ZHU Rui1,2,LYU Changlong1,LI Tong2,3+,HE Yahui1,LIU Hang1,ZHANG Cunming1,CHEN Yeting2,4   

  1. 1.School of Software,Yunnan University
    2.Key Laboratory in Software Engineering of Yunnan Province,Yunnan University
    3.School of Big Data,Yunnan Agricultural University
    4.School of Economics and Management,Yunnan Normal University
  • Online:2022-10-31 Published:2022-11-10
  • Supported by:
    Project supported by the National Natural Science Foundation,China(No.62002310),the Major Project of Science and Technology of Yunnan Province,China(No.202002AD080002),the Yunnan Provincial Natural Science Foundation,China(No.202101AT070004),the Yunnan Provincial Software Engineering Key Laboratory Open Fund,China(No.2020SE404),and the Yunnan Philosophy and Social Science Youth Foundation,China(No.QN2020024).

基于ON-LSTM的业务过程模型深度自动生成

朱锐1,2,吕昌龙1,李彤2,3+,何亚辉1,刘航1,张存明1,陈晔婷2,4   

  1. 1.云南大学软件学院
    2.云南省软件工程重点实验室
    3.云南农业大学大数据学院
    4.云南师范大学经济与管理学院
  • 基金资助:
    国家自然科学基金资助项目(62002310);云南省重大科技专项计划资助项目(202002AD080002);云南省自然科学基金基础研究面上资助项目(202101AT070004);云南省软件工程重点实验室开放基金资助项目(2020SE404);云南哲学社会科学青年资助项目(QN2020024)。

Abstract: To break the limitations brought by existing process mining algorithms that could not be used when logs were missing,a novel method for deep automatic generation of business process models from process text descriptions was proposed based on the existing deep learning and natural language processing technology base.The existing named entity method was improved,and the activity entity recognition model was constructed by Bidirectional Encoder Representation from Transformers (BERT),Bi-directional Long Short Term Memory (BiLSTM),Conditional Random Fields (CRF),and the business process-oriented activity entity recognition method was proposed.The language model was extended from sentence level to document level,and a recursive architecture Ordered Neurons LSTM (ON-LSTM) was proposed to unsupervised discover the activity embedded in the process description document The hierarchical tree was finally transformed into a business process model by using the principle of hierarchical depth of active entities.Experiments were conducted on 150 real System Applications and Products (SAP) product user guide texts collected and labeled manually as training data,and several group experiments were conducted on the basis of ON-LSTM using K-fold cross-validation idea,which verified the effectiveness of the proposed method.

Key words: deep learning, business process discovery, active entity, hierarchical structure, ordered neurons long short term memory

摘要: 为打破现有过程挖掘算法在日志缺失时无法使用带来的局限性,基于现有的深度学习、自然语言处理技术基础,提出一种新颖的从过程文本描述中深度自动生成业务过程模型的方法。对现有命名实体方法进行改进,通过BERT,BiLSTM,CRF构建活动实体识别模型,提出面向业务过程的活动实体识别方法;将语言模型从句子级别扩展到文档级别,提出一种通过递归体系结构有序神经网络(ON-LSTM)无监督地发现过程描述文档中所蕴含的活动实体间潜在的层次结构;通过活动实体的层次深度原则,将层次结构树转化为业务过程模型。通过对人工采集与标注的150个真实的SAP产品用户指南文本作为训练数据进行实验,并在ON-LSTM基础上采用K折交叉验证思想对数据进行多次分组实验,验证了所提方法的有效性。

关键词: 深度学习, 业务过程发现, 活动实体, 层次结构, 有序神经长短期记忆网络

CLC Number: