Decision-Making Models Based on Meta-Reinforcement Learning for Intelligent Vehicles at Urban Intersections|Xuemei Chen;Jiahe Liu;Zijia Wang;Xintong Han;Yufan Sun;Xuelong Zheng - 期刊导航|首站-论文投稿智能助手|论文发表|论文智能投稿|期刊自助发表推荐|杂志社快速发表|查同导刊-域田数据官方网站

典型文献

Decision-Making Models Based on Meta-Reinforcement Learning for Intelligent Vehicles at Urban Intersections

文献摘要：

Behavioral decision-making at urban intersections is one of the primary difficulties cur-rently impeding the development of intelligent vehicle technology. The problem is that existing decision-making algorithms cannot effectively deal with complex random scenarios at urban inter-sections. To deal with this, a deep deterministic policy gradient (DDPG) decision-making algo-rithm (T-DDPG) based on a time-series Markov decision process (T-MDP) was developed, where the state was extended to collect observations from several consecutive frames. Experiments found that T-DDPG performed better in terms of convergence and generalizability in complex intersec-tion scenarios than a traditional DDPG algorithm. Furthermore, model-agnostic meta-learning (MAML) was incorporated into the T-DDPG algorithm to improve the training method, leading to a decision algorithm (T-MAML-DDPG) based on a secondary gradient. Simulation experiments of intersection scenarios were carried out on the Gym-Carla platform to verify and compare the deci-sion models. The results showed that T-MAML-DDPG was able to easily deal with the random states of complex intersection scenarios, which could improve traffic safety and efficiency. The above decision-making models based on meta-reinforcement learning are significant for enhancing the decision-making ability of intelligent vehicles at urban intersections.

文献关键词：

中图分类号：

[1] 自动化技术、计算机技术（TP） / 计算技术、计算机技术（TP3） / 计算机的应用（TP39） / 信息处理(信息加工)（TP391）

[2] 医药、卫生（R） / 神经病学与精神病学（R74） / 神经病学（R741）

[3] 医药、卫生（R） / 基础医学（R3） / 病理学（R36） / 病理过程（R364）

作者姓名：

Xuemei Chen;Jiahe Liu;Zijia Wang;Xintong Han;Yufan Sun;Xuelong Zheng

作者机构：

School of Mechanical Engineering,Beijing Insti-tute of Technology,Beijing 100081,China

文献出处：

北京理工大学学报（英文版）

引用格式：

[1]Xuemei Chen;Jiahe Liu;Zijia Wang;Xintong Han;Yufan Sun;Xuelong Zheng-.Decision-Making Models Based on Meta-Reinforcement Learning for Intelligent Vehicles at Urban Intersections)[J].北京理工大学学报（英文版）,2022(04):327-339

A类：

Intersections,intersec

B类：

Decision,Making,Models,Based,Reinforcement,Learning,Intelligent,Vehicles,Urban,Behavioral,decision,making,urban,intersections,one,primary,difficulties,cur,rently,impeding,development,intelligent,technology,problem,that,existing,algorithms,cannot,effectively,deal,complex,random,scenarios,To,this,deep,deterministic,policy,gradient,DDPG,series,Markov,process,MDP,was,developed,where,extended,collect,observations,from,several,consecutive,frames,Experiments,found,performed,better,terms,convergence,generalizability,than,traditional,Furthermore,agnostic,meta,learning,MAML,incorporated,into,improve,training,method,leading,secondary,Simulation,experiments,were,carried,out,Gym,Carla,platform,verify,compare,models,results,showed,able,easily,states,which,could,traffic,safety,efficiency,above,reinforcement,significant,enhancing,vehicles

AB值：

0.54226

相似文献

Joint Trajectory and Passive Beamforming Optimization in IRS-UAV Enhanced Anti-Jamming Communication Networks

Zhifeng Hou;Jin Chen;Yuzhen Huang;Yijie Luo;Ximing Wang;Jiangchun Gu;Yifan Xu;Kailing Yao-PLA Army Engineering University,Nanjing 210007,China;Artificial Intelligence Research Center,Defense Innovation Institute,Beijing 100166,China;School of Information and Communication,Beijing University of Posts and Telecommunications,Beijing 100876,China;College of Information and Communication,National University of Defense Technology,Wuhan 430010,China

Stochastic Learning for Opportunistic Peer-to-Peer Computation Offloading in IoT Edge Computing

Siqi Mu;Yanfei Shen-School of Sports Engineering(China Big Data Center for Sports),Beijing Sport University,Beijing 100084,China

Trajectory Design for UAV-Enabled Maritime Secure Communications:A Reinforcement Learning Approach

Jintao Liu;Feng Zeng;Wei Wang;Zhichao Sheng;Xinchen Wei;Kanapathippillai Cumanan-School of Information Science and Technology,Nantong University,Nantong 226019,China;Nantong Research Institute for Advanced Communication Technologies,Nantong 226019,China;Key Laboratory of Specialty Fiber Optics and Optical Access Networks,Shanghai University,Shanghai 200444,China;Department of Electronic Engineering,University of York,York,YO105DD,United Kingdom

A Multi-Agent Reinforcement Learning-Based Collaborative Jamming System:Algorithm Design and Software-Defined Radio Implementation

Luguang Wang;Fei Song;Gui Fang;Zhibin Feng;Wen Li;Yifan Xu;Chen Pan;Xiaojing Chu-College of Communications Engineering,Army Engineering University of PLA,Nanjing 210000,China

Large-Scale Group Decision Making:A Systematic Review and a Critical Analysis