计算机集成制造系统 ›› 2022, Vol. 28 ›› Issue (10): 3146-3155.DOI: 10.13196/j.cims.2022.10.011

• • 上一篇    下一篇

车联网环境下基于强化学习的边缘服务器部署策略

严翰致1,许小龙1,4+,代飞2,齐连永3,窦万春4,李彤5   

  1. 1.南京信息工程大学计算机与软件学院
    2.西南林业大学大数据与智能工程学院
    3.曲阜师范大学信息科学与工程学院
    4.南京大学计算机软件新技术国家重点实验室
    5.云南农业大学大数据学院
  • 出版日期:2022-10-31 发布日期:2022-11-10
  • 基金资助:
    国家重点研发计划资助项目(2020YFB1707600);新疆生产建设兵团财政科技支撑计划资助项目(2020DB005)。

Edge server deployment strategy with reinforcement learning in Internet of vehicles

YAN Hanzhi1,XU Xiaolong1,4+,DAI Fei2,QI Lianyong3,DOU Wanchun4,LI Tong5   

  1. 1.School of Computer and Software,Nanjing University of Information Science and Technology
    2.College of Big Data and Intelligent Engineering,Southwest Forestry University
    3.School of Information Science and Engineering,Qufu Normal University
    4.State Key Laboratory for Novel Software Technology,Nanjing University
    5.College of Big Data,Yunnan Agricultural University
  • Online:2022-10-31 Published:2022-11-10
  • Supported by:
    Project supported by the National Key Research and Development Program,China(No.2020YFB1707600),and the Financial Science & Technology Supporting Plan of Xinjiang Production and Construction Corps,China(No.2020DB005).

摘要: 鉴于现有的边缘服务器部署策略主要用于改善5G、无线城域网等场景下的服务性能,无法直接用于车联网服务部署,提出一种边云协同的5G车联网边缘计算系统模型,针对该系统模型设计了基于强化学习的边缘服务器部署策略,其以负载优化为核心目标,在保证低延迟和低能耗前提下实现边缘服务器间的负载均衡。根据路边单元位置信息用Canopy聚类获取初始的聚簇数,用模糊C均值聚类获取路边单元的初始划分,并输出路边单元归属优先级矩阵;通过强化学习获得路边单元归属的最优状态并计算聚簇中心作为边缘服务器部署位置。通过对比实验验证了该策略在低服务延迟和低能耗下,能够高度实现边缘服务器间的负载均衡,表明该策略具有优越性。

关键词: 边缘计算, 负载均衡, 模糊C均值, 强化学习

Abstract: The existing edge server placement methods are mainly used to improve service performance in scenarios such as 5G and wireless metropolitan area networks,but cannot be directly used for the deployment of Internet of Vehicles (IoV) services.Therefore,an edge computing system model with edge-cloud collaboration for 5G-IoV was proposed,and a deployment Strategy of edge servers based on Reinforcement Learning (SRL) was designed for this system model.Specifically,load optimization was taken as the core goal,and the load balancing among edge servers was realized under the premise of low delay and consumption.According to the location information of the roadside unit,the clustering algorithm Canopy was used to calculate the initial number of clusters.The initial division of the roadside unit was obtained using fuzzy C-means,and the roadside unit attribution priority matrix was output.Through the reinforcement learning,the optimal state of the roadside unit was obtained and the cluster center was calculated as the deployment location of the edge server.Comparative experiments verified that SRL had achieved a high degree of load balancing between edge servers under the premise of low service delay and consumption,which demonstrated the superiority of SRL.

Key words: edge computing, load balance, fuzzy C-means, reinforcement learning

中图分类号: