计算机集成制造系统 ›› 2019, Vol. 25 ›› Issue (第4): 920-928.DOI: 10.13196/j.cims.2019.04.014

• 当期目次 • 上一篇    下一篇

基于日志的流程相似度计算方法适用性分析

林泽东1,曾庆田1,段华2+,鲁法明1,周长红1   

  1. 1.山东科技大学计算机科学与工程学院
    2.山东科技大学数学与系统科学学院
  • 出版日期:2019-04-30 发布日期:2019-04-30
  • 基金资助:
    国家自然科学基金资助项目(71704096,61602278,61602279,61472229,31671588);山东省科技发展计划资助项目(2014GGX101035,2016ZDJS02A11);山东省自然科学基金资助项目(BS2014DX013,ZR2015FM013,ZR2017MF027);国家海洋局海洋遥测工程技术研究中心开放基金资助项目(2018002);交通运输部公路科学研究院资助项目(2015-9024,2016-9027);山东省博士后创新专项资金资助项目(201603056);山东科技大学领军人才与优秀科研团队计划资助项目(2015TDJH102);教育部人文社会科学研究资助项目(16YJCZH012)。

Applicability analysis of log-based process similarity computing method

  • Online:2019-04-30 Published:2019-04-30
  • Supported by:
    Project supported by the National Natural Science Foundation,China(No.71704096,61602278,61602279,61472229,31671588),the Science & Technology Development Fund of Shandong Province,China(No.2014GGX101035,2016ZDJS02A11),the Shandong Provincial Natural Science Foundation,China(No.BS2014DX013,ZR2015FM013,ZR2017MF027),the Open Fund of Oceanic Telemetry Engineering and Technology Research Center,State Oceanic Administration,China(No.2018002),the Research Institute of Highway Ministry of Transport,China(No.2015-9024,2016-9027),the Shandong Provincial Postdoctoral Innovation Foundation,China(No.201603056),the SDUST Research Fund,China(No.2015TDJH102),and the Humanities and Social Science Foundation of Ministry of Education,China(No.16YJCZH012).

摘要: 针对现有业务流程相似度计算方法的不足,从流程日志出发计算流程相似度,并根据以流程日志为输入的特点对现有方法进行了改进,提出两种改进方法。一种方法是通过日志重构流程模型,基于模型结构计算流程相似度,但是为体现流程日志中对流程结构某些执行路径或分支的倾向性,在流程模型上重演日志轨迹,利用A*算法寻找匹配日志轨迹的最佳重演方式,识别最佳重演方式下被执行的流关系,以对流关系加权生成加权BPMN模型计算流程相似度;另一种方法不需重构模型,基于任务序列计算流程相似度,为支持任务相似度计算,定义了任务间的距离,由此得到支持任务距离度量的轨迹距离,进一步利用轨迹多集计算流程相似度。通过在不同的真实数据集上的实验,分析了两种方法在相似度计算结果的异同之处,并总结了两种方法的适用范围。

关键词: 业务流程, 模型结构, 日志行为, 相似度计算

Abstract: To address the issues of existing business process similarity computation methods,the process similarity based on process logs was computed,these existing methods were improved based on the characteristics of process logs as input and two improved methods were proposed.One method reconstructed the process model through the log and computed the process similarity based on the model structure.However,to embody the tendency of some execution path or branch in the process log,the log trajectory was replayed on the process model,the best reacting model was used to find the matching log track by A* algorithm,and the executed flow relationship was identified under the best reacting model.On this basis,the weighted BPMN model generated by flow relationship which was weighted was used to compute the process similarity.The other method did not need to reconstruct the model and compute the process similarity based on the log behavior sequence.To support the task similarity computing,the distance between tasks was defined.Thus,the distance of task sequence supporting task distance was obtained,and the business process similarity was computed by using the multiple set of task sequence.Through the experiments on different real data sets,the similarities and differences of the two methods in similarity computing were analyzed,and the scope of application of the two methods was summarized.

Key words: business process, model structure, log behavior, similarity computing

中图分类号: