计算机集成制造系统 ›› 2016, Vol. 22 ›› Issue (第2期): 330-342.DOI: 10.13196/j.cims.2016.02.006

• 产品创新开发技术 • 上一篇    下一篇

启发式并行化单触发序列挖掘算法

朱锐1,李彤1,2+,莫启1,代飞1,2,高提雷1,何云1,孙雪1   

  1. 1.云南大学软件学院
    2.云南大学云南省软件工程重点实验室
  • 出版日期:2016-02-29 发布日期:2016-02-29
  • 基金资助:
    国家自然科学基金资助项目(61262024,61262025,61462095,61462091,61379032);云南省自然科学青年基金资助项目(2014FD006);云南省教育厅科研重点资助项目(2013Z057,2015Z018);云南省软件工程重点实验室开放基金资助项目(2012SE401);云南省科技厅面上资助项目(2012FB119);云南大学研究生科研课题资助项目(ynuy201425);云南省博士研究生学术新人奖资助项目(ynu201416)。

Heuristic parallelized mining single firing sequence

  • Online:2016-02-29 Published:2016-02-29
  • Supported by:
    Project supported by the National Natural Science Foundation,China(No.61262024,61262025,61462095,61462091,61379032),the Natural Science Youth Foundation of Yunnan Province,China(No.2014FD006),the Key Science Research Project of Yunnan Education,China(No.2013Z057,2015Z018),the Open Fund of Yunnan Provincial Key Lab of Software Engineering,China(No.2012SE401),the General Project of Yunnan Technology,China(No.2012FB119),the Graduate Research Foundation of Yunnan University,China(No.ynuy201425),and the Scholarship Award for Excellent Doctoral Student of Yunnan Province,China(No.ynu201416).

摘要: 为解决因缺少挖掘所需案例属性的支持而无法使用成熟挖掘算法对单触发序列进行挖掘的问题,从模型层和实例层双视角进行研究,从模型角度证明轨迹中存在循环以保证挖掘基础的正确性,提出构建并发块集来解决并发活动对案例划分引起混淆的问题;从轨迹角度对启发式方法进行改进以适应案例划分,提出启发式的并发关系度量方法以降低噪声对并发关系挖掘的影响,通过构建含有并发关系的依赖关系表对案例进行划分。综合提出一个针对活动集并行化地进行案例划分,并根据其拟合度择优选择最佳案例的方法框架。通过大量基于真实数据集的实验展示了该方法针对单触发序列挖掘的有效性和正确性。

关键词: 过程挖掘, 单触发序列, 启发式方法, 案例划分, Petri网

Abstract: To solve the problem that single firing sequence could not use the mature mining approach due to the absence of case information,a dual-view approach included model level and instance level was adopt.From the model perspective,the cycle was existed in a trace to ensure the correctness of mining foundation,and a concurrence block set to resolve the promiscuous problem caused by concurrent activities was proposed;from the trace perspective,a heuristic approach modified to accommodate cases division was presented to measure concurrent relationship so as to reduce the impact of noise,and a dependency table contained the concurrent relationship was built to separate the cases.By integrating all aspects,a framework to separate cases concurrently according to activities was put forward,and a best case based on fitness was selected.The real world data was used for extensive experiments sets to show the effectiveness and correctness of proposed method on single firing sequence mining.

Key words: process mining, single firing sequence, heuristic approach, case separating, Petri nets

中图分类号: