车联网环境下基于强化学习的边缘服务器部署策略

doi:10.13196/j.cims.2022.10.011

计算机集成制造系统 ›› 2022, Vol. 28 ›› Issue (10): 3146-3155.DOI: 10.13196/j.cims.2022.10.011

车联网环境下基于强化学习的边缘服务器部署策略

严翰致¹,许小龙^1,4+,代飞²,齐连永³,窦万春⁴,李彤⁵

1.南京信息工程大学计算机与软件学院
2.西南林业大学大数据与智能工程学院
3.曲阜师范大学信息科学与工程学院
4.南京大学计算机软件新技术国家重点实验室
5.云南农业大学大数据学院

出版日期:2022-10-31 发布日期:2022-11-10
基金资助:
国家重点研发计划资助项目(2020YFB1707600);新疆生产建设兵团财政科技支撑计划资助项目(2020DB005)。

Edge server deployment strategy with reinforcement learning in Internet of vehicles

YAN Hanzhi¹,XU Xiaolong^1,4+,DAI Fei²,QI Lianyong³,DOU Wanchun⁴,LI Tong⁵

1.School of Computer and Software,Nanjing University of Information Science and Technology
2.College of Big Data and Intelligent Engineering,Southwest Forestry University
3.School of Information Science and Engineering,Qufu Normal University
4.State Key Laboratory for Novel Software Technology,Nanjing University
5.College of Big Data,Yunnan Agricultural University

Online:2022-10-31 Published:2022-11-10
Supported by:
Project supported by the National Key Research and Development Program,China(No.2020YFB1707600),and the Financial Science & Technology Supporting Plan of Xinjiang Production and Construction Corps,China(No.2020DB005).

摘要/Abstract

摘要： 鉴于现有的边缘服务器部署策略主要用于改善5G、无线城域网等场景下的服务性能,无法直接用于车联网服务部署,提出一种边云协同的5G车联网边缘计算系统模型,针对该系统模型设计了基于强化学习的边缘服务器部署策略,其以负载优化为核心目标,在保证低延迟和低能耗前提下实现边缘服务器间的负载均衡。根据路边单元位置信息用Canopy聚类获取初始的聚簇数,用模糊C均值聚类获取路边单元的初始划分,并输出路边单元归属优先级矩阵;通过强化学习获得路边单元归属的最优状态并计算聚簇中心作为边缘服务器部署位置。通过对比实验验证了该策略在低服务延迟和低能耗下,能够高度实现边缘服务器间的负载均衡,表明该策略具有优越性。

关键词: 边缘计算, 负载均衡, 模糊C均值, 强化学习

Abstract: The existing edge server placement methods are mainly used to improve service performance in scenarios such as 5G and wireless metropolitan area networks,but cannot be directly used for the deployment of Internet of Vehicles (IoV) services.Therefore,an edge computing system model with edge-cloud collaboration for 5G-IoV was proposed,and a deployment Strategy of edge servers based on Reinforcement Learning (SRL) was designed for this system model.Specifically,load optimization was taken as the core goal,and the load balancing among edge servers was realized under the premise of low delay and consumption.According to the location information of the roadside unit,the clustering algorithm Canopy was used to calculate the initial number of clusters.The initial division of the roadside unit was obtained using fuzzy C-means,and the roadside unit attribution priority matrix was output.Through the reinforcement learning,the optimal state of the roadside unit was obtained and the cluster center was calculated as the deployment location of the edge server.Comparative experiments verified that SRL had achieved a high degree of load balancing between edge servers under the premise of low service delay and consumption,which demonstrated the superiority of SRL.

Key words: edge computing, load balance, fuzzy C-means, reinforcement learning

中图分类号:

TP393

严翰致, 许小龙, 代飞, 齐连永, 窦万春, 李彤. 车联网环境下基于强化学习的边缘服务器部署策略[J]. 计算机集成制造系统, 2022, 28(10): 3146-3155.

YAN Hanzhi, XU Xiaolong, DAI Fei, QI Lianyong, DOU Wanchun, LI Tong. Edge server deployment strategy with reinforcement learning in Internet of vehicles[J]. Computer Integrated Manufacturing System, 2022, 28(10): 3146-3155.

[1]	黄子钊, 庄子龙, 滕浩, 秦威, 秦涛, 邹鹰. 自动化码头出口箱箱位分配优化超启发式算法[J]. 计算机集成制造系统, 2022, 28(8): 2619-2632.
[2]	杨琪森, 王慎执, 桑金楠, 王朝飞, 黄高, 吴澄, 宋士吉. 复杂开放水域下智能船舶路径规划与避障方法[J]. 计算机集成制造系统, 2022, 28(7): 2030-2040.
[3]	崔建双, 吕玥, 徐子涵. 基于Q—学习的超启发式模型及算法求解多模式资源约束项目调度问题[J]. 计算机集成制造系统, 2022, 28(5): 1472-1481.
[4]	高鹏, 苏雍贺, 左颖, 陶飞. 基于强化学习的分布式光伏运维资源动态调度[J]. 计算机集成制造系统, 2022, 28(2): 552-563.
[5]	周晓婷, 吴禄彬, 章宇, 姜善成. 基于不确定需求的无人驾驶出租车优化调度[J]. 计算机集成制造系统, 2022, 28(11): 3433-3442.
[6]	陶永, 兰江波, 任帆, 王田苗, 江山, 高赫, 温宇方. 基于自适应模糊神经网络的机器人焊接焊缝外形预测方法[J]. 计算机集成制造系统, 2022, 28(11): 3643-3651.
[7]	陈乔鑫, 卢宇, 林兵, 王素云, 邵浚. 车载边缘计算中推理任务的实时调度策略[J]. 计算机集成制造系统, 2022, 28(10): 3295-3303.
[8]	刘国志, 代飞, 莫启, 许小龙, 强振平, 王雷光. 车辆边缘计算环境下基于深度强化学习的服务卸载方法[J]. 计算机集成制造系统, 2022, 28(10): 3304-3315.
[9]	王誓伟,徐晓斌,梁中军. 基于城市计算的分布式异常数据分级过滤算法[J]. 计算机集成制造系统, 2021, 27(9): 2525-2531.
[10]	周博文,黄海军,徐怡,李学俊,高寒,陈天翔,刘晓,徐佳. 无人机配送系统中端边协同的并行任务调度算法[J]. 计算机集成制造系统, 2021, 27(9): 2575-2582.
[11]	李炜,蒋越,闵江松,张以文,王庆人. 边缘计算环境下自适应移动路径感知的用户分配算法[J]. 计算机集成制造系统, 2021, 27(9): 2592-2603.
[12]	刘庆祥,许小龙,张旭云,窦万春. 基于联邦学习的边缘智能协同计算与隐私保护方法[J]. 计算机集成制造系统, 2021, 27(9): 2604-2610.
[13]	冯春,张祎伟,黄成,姜文彪,武之炜. 双足机器人步态控制的深度强化学习方法[J]. 计算机集成制造系统, 2021, 27(8): 2341-2349.
[14]	袁友伟,黄锡恺,俞东进,李忠金. 移动边缘计算环境下服务工作流容错调度算法[J]. 计算机集成制造系统, 2021, 27(6): 1693-1702.
[15]	马靖,王译晨,赵明,蒋增强,鄂明成,王强. 基于数字孪生的生产单元可视化管控[J]. 计算机集成制造系统, 2021, 27(5): 1256-1268.

车联网环境下基于强化学习的边缘服务器部署策略

Edge server deployment strategy with reinforcement learning in Internet of vehicles

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics