首站-论文投稿智能助手
典型文献
Multi-agent reinforcement learning for edge information sharing in vehicular networks
文献摘要:
To guarantee the heterogeneous delay requirements of the diverse vehicular services,it is necessary to design a full cooperative policy for both Vehicle to Infrastructure(V2I)and Vehicle to Vehicle(V2V)links.This paper investigates the reduction of the delay in edge information sharing for V2V links while satisfying the delay re-quirements of the V2I links.Specifically,a mean delay minimization problem and a maximum individual delay minimization problem are formulated to improve the global network performance and ensure the fairness of a single user,respectively.A multi-agent reinforcement learning framework is designed to solve these two prob-lems,where a new reward function is proposed to evaluate the utilities of the two optimization objectives in a unified framework.Thereafter,a proximal policy optimization approach is proposed to enable each V2V user to learn its policy using the shared global network reward.The effectiveness of the proposed approach is finally validated by comparing the obtained results with those of the other baseline approaches through extensive simulation experiments.
文献关键词:
作者姓名:
Ruyan Wang;Xue Jiang;Yujie Zhou;Zhidu Li;Dapeng Wu;Tong Tang;Alexander Fedotov;Vladimir Badenko
作者机构:
School of Communication and Information Engineering, Chongqing University of Posts and Telecommunications,Chongqing,400065,China;Advanced Network and Intelligent Connection Technology Key Laboratory of Chongqing Education Commission of China,Chongqing,400065,China;Key Laboratory of Ubiquitous Sensing and Networking in Chongqing,Chongqing,400065,China;Peter the Great St.Petersburg Polytechnic University,Polytechnicheskaya,29,St.Petersburg,195251,Russia
引用格式:
[1]Ruyan Wang;Xue Jiang;Yujie Zhou;Zhidu Li;Dapeng Wu;Tong Tang;Alexander Fedotov;Vladimir Badenko-.Multi-agent reinforcement learning for edge information sharing in vehicular networks)[J].数字通信与网络(英文),2022(03):267-277
A类:
B类:
Multi,agent,reinforcement,learning,edge,information,sharing,vehicular,networks,To,guarantee,heterogeneous,delay,requirements,diverse,services,necessary,full,cooperative,policy,both,Vehicle,Infrastructure,V2I,V2V,links,This,paper,investigates,reduction,while,satisfying,Specifically,mean,minimization,problem,maximum,individual,formulated,improve,global,performance,ensure,fairness,single,user,respectively,multi,framework,designed,solve,these,lems,where,new,reward,function,proposed,evaluate,utilities,optimization,objectives,unified,Thereafter,proximal,enable,each,its,using,shared,effectiveness,finally,validated,by,comparing,obtained,results,those,other,baseline,approaches,through,extensive,simulation,experiments
AB值:
0.564443
相似文献
Real-time energy optimization of HEVs under-connected environment:a benchmark problem and receding horizon-based solution
Fuguo Xu;Hiroki Tsunogawa;Junichi Kako;Xiaosong Hu;Shengbo Eben Li;Tielong Shen;Lars Eriksson;Carlos Guardiola-Department of Engineering and Applied Sciences,Sophia University,Tokyo 102-8554,Japan;Higashi-Fuji Research Center,Toyota Motor Corporation,Shizuoka 410-1107,Japan;The State Key Laboratory of Mechanical Transmissions,Chongqing University,Chongqing 400044,China;The State Key Laboratory of Automotive Safety and Energy,Tsinghua University,Beijing 100084,China;Vehicular Systems,Department Electrical Engineering,Link?ping University,SE 581 83,Link?ping,Sweden;Departamento de Máquinas y Motores Térmicos,Universitat Politècnica de València,Camino de Vera s/n,46022 Valencia,Spain
机标中图分类号,由域田数据科技根据网络公开资料自动分析生成,仅供学习研究参考。