Collaborative manufacturing scheduling based on deep reinforcement learning

doi:10.13196/j.cims.2024.0187

Computer Integrated Manufacturing System ›› 2025, Vol. 31 ›› Issue (8): 2857-2869.DOI: 10.13196/j.cims.2024.0187

Previous Articles Next Articles

Collaborative manufacturing scheduling based on deep reinforcement learning

TANG Liang,KUANG Lilin

College of Transportation Engineering,Dalian Maritime University

Online:2025-08-31 Published:2025-09-04
Supported by:
Project supported by the National Natural Science Foundation,China(No.72372015),the National Social Science Foundation,China(No.24GBL026).

基于深度强化学习算法的协同制造调度优化

唐亮,匡理霖

大连海事大学交通运输工程学院

作者简介:
唐亮(1980-),男,江苏宜兴人,教授,博士,研究方向:供应链调度及优化、智能制造等,E-mail:tangericliang@dlmu.edu.cn;

匡理霖(1998-),男,四川内江人,硕士研究生,研究方向:协同制造、深度强化学习,E-mail:kllddu88@dlmu.edu.cn。
基金资助:
国家自然科学基金资助项目(72372015);国家社科基金资助项目(24BGL026)。

Abstract

Abstract: To solve a collaborative manufacturing scheduling problem where the Dominant Manufacturer (DM) outsources several processes to Collaborative Manufacturers (CMs) with the objective of minimizing overall costs for different orders,a Deep Reinforcement Learning (DRL) framework was proposed,which integrated disjunctive graph analysis to tackle the intricate scheduling dynamics inherent in collaborative manufacturing networks.In this way,the Agent learns the action strategy based on the input order status.The scheduling problem was transformed into a sequential decision-making task by employing a two-dimensional action space derived from the disjunctive graph structure.Through setting the manufacturing state as the input of a deep neural network model,the collaborative scheduling problem was transformed into a Markov Decision Process (MDP) problem.Additionally,a cost-oriented reward function was formulated to guide the exploration process of the agent aiming to identify the optimal action.The experimental results showed that the deep reinforcement learning algorithm outperformed any single scheduling rule,and also performed more superiorly on average in terms of solution effectiveness when compared to genetic algorithms.

Key words: collaborative manufacturing, disjunctive graph, deep reinforcement learning, scheduling, Markov decision process

摘要： 针对主导制造商将多个流程外包给协同制造商的协同调度问题,考虑同类产品不同订单在协同制造网络中的分配与调度,以多种成本之和的最小化为目标,提出了一种结合析取图的深度强化学习算法框架。根据输入的订单状态学习动作策略,将析取图的调度过程转化为一个多阶段的序列决策过程;利用基于析取图的状态空间,将协同制造订单的状态视为多通道图像输入网络,并依据状态转移的特点设计了包含订单选择规则和协同制造商指派规则的二维动作空间。根据问题的目标构造了关于成本的奖励函数,以指导智能体与环境交互,获取每个决策步最佳策略。实验结果表明,深度强化学习算法优于单一调度规则,与遗传算法相比较,在平均求解效果上表现更为优越。

关键词: 协同制造, 析取图, 深度强化学习, 调度, 马尔可夫决策过程

CLC Number:

TP18

TANG Liang, KUANG Lilin. Collaborative manufacturing scheduling based on deep reinforcement learning[J]. Computer Integrated Manufacturing System, 2025, 31(8): 2857-2869.

唐亮, 匡理霖. 基于深度强化学习算法的协同制造调度优化[J]. 计算机集成制造系统, 2025, 31(8): 2857-2869.

[1]	AN Youjun, ZHANG Jun, DONG Yuanfa, GAO Kaizhou, PENG Wei, ZHOU Bin. Integrated optimization of batch scheduling and multi-level imperfect maintenance for parallel batch-processing machines based on learning evolutionary algorithm [J]. Computer Integrated Manufacturing System, 2025, 31(9): 3277-3295.
[2]	LI Yiren, WANG Bailin, YUAN Shuaipeng, ZHANG Zhuolun, LI Tieke, WANG Yang. Hybrid particle swarm optimization for steelmaking-continuous casting scheduling with flexible continuous caster maintenance [J]. Computer Integrated Manufacturing System, 2025, 31(9): 3296-3307.
[3]	ZHAO Cai, WU Lianghong, ZUO Cili, ZHANG Hongqiang, LI Zhijing. Energy-efficient scheduling method of distributed assembly hybrid flow shop [J]. Computer Integrated Manufacturing System, 2025, 31(9): 3324-3337.
[4]	WANG Yufang, HUA Xiaolin, ZENG Yazhi, CHEN Fan, YAO Binbin. Dual-resource constraints lot streaming scheduling for aviation structural components [J]. Computer Integrated Manufacturing System, 2025, 31(9): 3338-3353.
[5]	LUO Zhuorong, LI Zhantao, CHEN Qingxin, PENG Chengfeng. In and out scheduling algorithm for automatic sorting system considering finite buffer [J]. Computer Integrated Manufacturing System, 2025, 31(9): 3354-3367.
[6]	HAN Yajuan, ZHANG Junkang, WU Tingying. Order acceptance and scheduling decisions considering resource constraints for C2M enterprises [J]. Computer Integrated Manufacturing System, 2025, 31(9): 3501-3512.
[7]	ZHOU Xu, MIAO Hui, YANG Jing, JIANG Wu, LIAO Xiaoyan, LI Yijun, LI Shaobo, LU Jialin. Edge computing resource scheduling overview:Historical perspective,architecture,modeling and method analysis [J]. Computer Integrated Manufacturing System, 2025, 31(8): 2695-2726.
[8]	XIONG Fuli, CHEN Siyuan, XIONG Ningxin, SHI Jiangbo. Distributed heterogeneous non-permutation flowshop scheduling based on two-stage hybrid iterative greedy algorithm [J]. Computer Integrated Manufacturing System, 2025, 31(8): 2870-2883.
[9]	WANG Jianhua, QIU Ronggen, WANG Heng. Scheduling problem of distributed hybrid flow shop based on MOHIG algorithm [J]. Computer Integrated Manufacturing System, 2025, 31(8): 2884-2893.
[10]	WU Haoze, LI Yanwu, XIE Hui. Improved proximal policy optimization algorithm for solving flexible job shop scheduling problem [J]. Computer Integrated Manufacturing System, 2025, 31(8): 2894-2904.
[11]	WU Xiuli, LI Yuxin. Proactive-reactive dynamic scheduling method for reentrant hybrid flow shop with batch processing machines [J]. Computer Integrated Manufacturing System, 2025, 31(7): 2466-2481.
[12]	GUO Leilei, YE Chunming, LIU Zijun, TANG Tianyu, ZHANG Shuman, YAN Jinhui. Distributed assembly permutation flowshop scheduling problem with renewable energy [J]. Computer Integrated Manufacturing System, 2025, 31(7): 2482-2498.
[13]	GU Wenbin, GUO Zhenyang, LIU Siqi, YUAN Minghai, PEI Fengque. Multi-objective discrete workshop energy saving scheduling based on improved MLEA algorithm [J]. Computer Integrated Manufacturing System, 2025, 31(7): 2499-2514.
[14]	SUN Lin, YU Chunxia. Supply-demand matching of capacity sharing platform considering scheduling constraints [J]. Computer Integrated Manufacturing System, 2025, 31(7): 2659-2678.
[15]	LIANG Peng, LIANG Yingxin, ZHANG Chaoyong. Integrated modeling and optimization for steelmaking-continuous casting scheduling with ladle matching [J]. Computer Integrated Manufacturing System, 2025, 31(6): 2028-2042.

Collaborative manufacturing scheduling based on deep reinforcement learning

基于深度强化学习算法的协同制造调度优化

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics