Research on virtual entity decision model for LVC tactical confrontation of army units|GAO Ang;GUO Qisheng;DONG Zhiming;TANG Zaijiang;ZHANG Ziwei;FENG Qiqi - 期刊导航|首站-论文投稿智能助手|论文发表|论文智能投稿|期刊自助发表推荐|杂志社快速发表|查同导刊-域田数据官方网站

典型文献

Research on virtual entity decision model for LVC tactical confrontation of army units

文献摘要：

According to the requirements of the live-virtual-con-structive(LVC)tactical confrontation(TC)on the virtual entity(VE)decision model of graded combat capability,diversified actions,real-time decision-making,and generalization for the enemy,the confrontation process is modeled as a zero-sum stochastic game(ZSG).By introducing the theory of dynamic re-lative power potential field,the problem of reward sparsity in the model can be solved.By reward shaping,the problem of credit assignment between agents can be solved.Based on the idea of meta-learning,an extensible multi-agent deep reinforcement learning(EMADRL)framework and solving method is proposed to improve the effectiveness and efficiency of model solving.Experiments show that the model meets the requirements well and the algorithm learning efficiency is high.

文献关键词：

中图分类号：

[1] 医药、卫生（R） / 基础医学（R3） / 病理学（R36） / 病理过程（R364）

[2] 自动化技术、计算机技术（TP） / 计算技术、计算机技术（TP3） / 计算机的应用（TP39） / 信息处理(信息加工)（TP391）

[3] 医药、卫生（R） / 药学（R9） / 药理学（R96） / 实验药理学（R965）

作者姓名：

GAO Ang;GUO Qisheng;DONG Zhiming;TANG Zaijiang;ZHANG Ziwei;FENG Qiqi

作者机构：

Military Exercise and Training Center,Army Academy of Armored Forces,Beijing 100072,China

文献出处：

系统工程与电子技术（英文版）

引用格式：

[1]GAO Ang;GUO Qisheng;DONG Zhiming;TANG Zaijiang;ZHANG Ziwei;FENG Qiqi-.Research on virtual entity decision model for LVC tactical confrontation of army units)[J].系统工程与电子技术（英文版）,2022(05):1249-1267

A类：

army,EMADRL

B类：

Research,virtual,entity,decision,LVC,tactical,confrontation,units,According,requirements,live,structive,VE,graded,combat,capability,diversified,actions,real,making,generalization,enemy,process,modeled,zero,sum,stochastic,game,ZSG,By,introducing,theory,dynamic,lative,power,potential,field,problem,reward,sparsity,can,solved,shaping,credit,assignment,between,agents,Based,idea,meta,learning,extensible,multi,deep,reinforcement,framework,solving,method,proposed,improve,effectiveness,efficiency,Experiments,show,that,meets,well,algorithm,high

AB值：

0.584486

相似文献

Towards autonomous and optimal excavation of shield machine:a deep reinforcement learning-based approach

Ya-kun ZHANG;Guo-fang GONG;Hua-yong YANG;Yu-xi CHEN;Geng-lin CHEN-State Key Laboratory of Fluid Power and Mechatronic Systems,Zhejiang University,Hangzhou 310027,China;School of Electrical and Power Engineering,China University of Mining and Technology,Xuzhou 221116,China

Low-loss belief propagation decoder with Tanner graph in quantum error-correction codes

Dan-Dan Yan;Xing-Kui Fan;Zhen-Yu Chen;Hong-Yang Ma-School of Sciences,Qingdao University of Technology,Qingdao 266033,China

A novel physics-informed framework for reconstruction of structural defects

Qi LI;Fushun LIU;Bin WANG;D.Z.LIU;Zhenghua QIAN-State Key Laboratory of Mechanics and Control of Mechanical Structures,College of Aerospace Engineering,Nanjing University of Aeronautics and Astronautics,Nanjing 210016,China;College of Engineering,Ocean University of China,Qingdao 266100,Shandong Province,China;School of Engineering,University of East Anglia,Norwich NR4 7TJ,U.K.

Multi-agent deep reinforcement learning for end–edge orchestrated resource allocation in industrial wireless networks

Xiaoyu LIU;Chi XU;Haibin YU;Peng ZENG-State Key Laboratory of Robotics,Shenyang Institute of Automation,Chinese Academy of Sciences,Shenyang 110016,China;Key Laboratory of Networked Control Systems,Chinese Academy of Sciences,Shenyang 110016,China;Institutes for Robotics and Intelligent Manufacturing,Chinese Academy of Sciences,Shenyang 110169,China;University of Chinese Academy of Sciences,Beijing 100049,China

Minimax Q-learning design for H∞ control of linear discrete-time systems