典型文献
Hierarchical reinforcement learning guidance with threat avoidance
文献摘要:
The guidance strategy is an extremely critical factor in determining the striking effect of the missile operation.A novel guidance law is presented by exploiting the deep reinforcement learning(DRL)with the hierarchical deep deterministic policy gradient(DDPG)algorithm.The reward functions are con-structed to minimize the line-of-sight(LOS)angle rate and avoid the threat caused by the opposed obstacles.To attenuate the chattering of the acceleration,a hierarchical reinforcement learn-ing structure and an improved reward function with action penalty are put forward.The simulation results validate that the missile under the proposed method can hit the target success-fully and keep away from the threatened areas effectively.
文献关键词:
中图分类号:
作者姓名:
LI Bohao;WU Yunjie;LI Guofei
作者机构:
State Key Laboratory of Virtual Reality Technology and System,Beihang University,Beijing 100191,China;School of Automation Science and Electrical Engineering,Beihang University,Beijing 100191,China;Science and Technology on Aircraft Control Laboratory,Beijing 100191,China;School of Astronautics,Northwestern Polytechnical University,Xi'an 710072,China
文献出处:
引用格式:
[1]LI Bohao;WU Yunjie;LI Guofei-.Hierarchical reinforcement learning guidance with threat avoidance)[J].系统工程与电子技术(英文版),2022(05):1173-1185
A类:
B类:
Hierarchical,reinforcement,learning,guidance,avoidance,strategy,extremely,critical,determining,striking,missile,operation,novel,law,presented,by,exploiting,deep,DRL,hierarchical,deterministic,policy,gradient,DDPG,algorithm,reward,functions,con,structed,minimize,line,sight,LOS,angle,caused,opposed,obstacles,To,attenuate,chattering,acceleration,structure,improved,action,penalty,put,forward,simulation,results,validate,that,under,proposed,method,can,hit,target,success,fully,keep,away,from,threatened,areas,effectively
AB值:
0.653123
相似文献
机标中图分类号,由域田数据科技根据网络公开资料自动分析生成,仅供学习研究参考。