Multi-agent reinforcement learning for edge information sharing in vehicular networks|Ruyan Wang;Xue Jiang;Yujie Zhou;Zhidu Li;Dapeng Wu;Tong Tang;Alexander Fedotov;Vladimir Badenko|Advanced Network and Intelligent Connection Technology Key Laboratory of Chongqing Education Commission of China,Chongqing,400065,China - 期刊导航|首站-论文投稿智能助手|论文发表|论文智能投稿|期刊自助发表推荐|杂志社快速发表|查同导刊-域田数据官方网站

典型文献

Multi-agent reinforcement learning for edge information sharing in vehicular networks

文献摘要：

To guarantee the heterogeneous delay requirements of the diverse vehicular services,it is necessary to design a full cooperative policy for both Vehicle to Infrastructure(V2I)and Vehicle to Vehicle(V2V)links.This paper investigates the reduction of the delay in edge information sharing for V2V links while satisfying the delay re-quirements of the V2I links.Specifically,a mean delay minimization problem and a maximum individual delay minimization problem are formulated to improve the global network performance and ensure the fairness of a single user,respectively.A multi-agent reinforcement learning framework is designed to solve these two prob-lems,where a new reward function is proposed to evaluate the utilities of the two optimization objectives in a unified framework.Thereafter,a proximal policy optimization approach is proposed to enable each V2V user to learn its policy using the shared global network reward.The effectiveness of the proposed approach is finally validated by comparing the obtained results with those of the other baseline approaches through extensive simulation experiments.

文献关键词：

中图分类号：

[1] 自动化技术、计算机技术（TP） / 计算技术、计算机技术（TP3） / 计算机的应用（TP39） / 信息处理(信息加工)（TP391）

[2] 自动化技术、计算机技术（TP） / 自动化基础理论（TP1） / 人工智能理论（TP18）

[3] 无线电电子学、电信技术（TN） / 无线通信（TN92） / 波导通信、毫米波通信（TN928） / 移动通信（TN929.5）

作者姓名：

Ruyan Wang;Xue Jiang;Yujie Zhou;Zhidu Li;Dapeng Wu;Tong Tang;Alexander Fedotov;Vladimir Badenko

作者机构：

School of Communication and Information Engineering, Chongqing University of Posts and Telecommunications,Chongqing,400065,China;Advanced Network and Intelligent Connection Technology Key Laboratory of Chongqing Education Commission of China,Chongqing,400065,China;Key Laboratory of Ubiquitous Sensing and Networking in Chongqing,Chongqing,400065,China;Peter the Great St.Petersburg Polytechnic University,Polytechnicheskaya,29,St.Petersburg,195251,Russia

文献出处：

数字通信与网络（英文）

引用格式：

[1]Ruyan Wang;Xue Jiang;Yujie Zhou;Zhidu Li;Dapeng Wu;Tong Tang;Alexander Fedotov;Vladimir Badenko-.Multi-agent reinforcement learning for edge information sharing in vehicular networks)[J].数字通信与网络（英文）,2022(03):267-277

A类：

B类：

Multi,agent,reinforcement,learning,edge,information,sharing,vehicular,networks,To,guarantee,heterogeneous,delay,requirements,diverse,services,necessary,full,cooperative,policy,both,Vehicle,Infrastructure,V2I,V2V,links,This,paper,investigates,reduction,while,satisfying,Specifically,mean,minimization,problem,maximum,individual,formulated,improve,global,performance,ensure,fairness,single,user,respectively,multi,framework,designed,solve,these,lems,where,new,reward,function,proposed,evaluate,utilities,optimization,objectives,unified,Thereafter,proximal,enable,each,its,using,shared,effectiveness,finally,validated,by,comparing,obtained,results,those,other,baseline,approaches,through,extensive,simulation,experiments

AB值：

0.564443

相似文献

Combat network link prediction based on embedding learning

SUN Jianbin;LI Jichao;YOU Yaqian;JIANG Jiang;Ge Bingfeng-College of Systems Engineering,National University of Defense Technology,Changsha 410073,China

Choice of discount rate in reinforcement learning with long-delay rewards

LIN Xiangyang;XING Qinghua;LIU Fuxian-Department of Air Defense and Anti-Missile,Air Force Engineering University,Xi'an 710051,China

Knowledge transfer in multi-agent reinforcement learning with incremental number of agents

LIU Wenzhang;DONG Lu;LIU Jian;SUN Changyin-School of Automation,Southeast University,Nanjing 210096,China;School of Cyber Science and Engineering,Southeast University,Nanjing 211189,China

Day-ahead scheduling based on reinforcement learning with hybrid action space

CAO Jingyu;DONG Lu;SUN Changyin-School of Automation, Southeast University, Nanjing 210096, China;School of Cyber Science and Engineering, Southeast University, Nanjing 211189, China

A deep reinforcement learning method for multi-stage equip-ment development planning in uncertain environments