Computer Integrated Manufacturing System ›› 2024, Vol. 30 ›› Issue (8): 2681-2687.DOI: 10.13196/j.cims.2023.BPM08

Previous Articles     Next Articles

Machine learning workflow recommendation for data analysis in industrial Internet

WEN Yiping1+,TIAN Muyang1,TAN Zheng2,KANG Guosheng1,LIU Jianxun1   

  1. 1.Hunan Provincial Key Laboratory for Service computing and Novel Software Technology,Hunan University of Science and Technology
    2.China Railway Construction Heavy Industry Corporation Limited
  • Online:2024-08-31 Published:2024-09-03
  • Supported by:
    Project supported by the National Key R&D Program,China(No.2020YFB1707600),the National Natural Science Foundation,China(No.62177014),and the Educational Department of Hunan Province,China(No.20B222).

面向工业互联网数据分析的机器学习工作流推荐方法

文一凭1+,田沐阳1,谭铮2,康国胜1,刘建勋1   

  1. 1.湖南科技大学服务计算与软件服务新技术湖南省重点实验室
    2.中国铁建重工集团股份有限公司
  • 作者简介:
    +文一凭(1981-),男,湖南祁阳人,教授,博士,CCF会员,研究方向:工作流系统、数据挖掘、云计算与分布式处理,通讯作者,E-mail:ypwen_0@qq.com;

    田沐阳(1999-),男,湖南株洲人,硕士研究生,研究方向:工作流技术与人工智能;

    谭铮(1997-),男,湖南娄底人,硕士,研究方向:智能制造与工作流技术;

    康国胜(1985-),男,湖南郴州人,博士,讲师,研究方向:服务计算、工作流技术与BPM等;

    刘建勋(1970-),男,湖南衡阳人,教授,博士,研究方向:服务计算、工作流技术与BPM等。
  • 基金资助:
    国家重点研发计划资助项目(2020YFB1707600);国家自然科学基金资助项目(62177014);湖南省教育厅资助项目(20B222)。

Abstract: The characteristics such as multi-modality and strong association of Industrial big data have brought many challenges.How to effectively accomplish the data analysis process according to the requirements of industrial applications is a complex,time-consuming and labor-intensive task.In view of this task,a method of machine learning workflow recommendation for data analysis was proposed in industrial Internet.It started from existing solutions and utilized their involved datasets and machine learning workflows to provide recommendation.Based on Doc2vec model and the maximum average difference method,the similarities between existing solutions and the data analysis requirements by their text descriptions and data distribution features were calculated,by which suitable machine learning workflows in existing solutions could be selected and recommended.The result of simulation experiments showed effectiveness of the proposed method.

Key words: industrial Internet, machine learning workflow, recommendation, data analysis

摘要: 工业大数据具有多模态和强关联等特性,这给数据分析与应用带来了新的挑战。如何根据工业应用需求的特点实施有效的数据分析过程通常是一项非常复杂、耗时耗力的任务。针对该问题,提出一种面向工业互联网数据分析的机器学习工作流推荐方法。该方法以已有解决方案为起点,将其所使用的数据集和机器学习工作流作为推荐参考,基于Doc2vec模型与最大平均差异方法计算文本描述相似度与数据分布特征相似度,可根据当前数据分析任务需求,推荐合适的已有解决方案中的机器学习工作流。仿真实验说明了该方法的有效性。

关键词: 工业互联网, 机器学习工作流, 推荐, 数据分析

CLC Number: