Reinforcement Learning Behavioral Control for Nonlinear Autonomous System|Zhenyi Zhang;Zhibin Mo;Yutao Chen;Jie Huang|Key Laboratory of Industrial Automation Control Technology and Information Processing,Education Department of Fujian Province,Fuzhou 350108 - 期刊导航|首站-论文投稿智能助手|论文发表|论文智能投稿|期刊自助发表推荐|杂志社快速发表|查同导刊-域田数据官方网站

典型文献

Reinforcement Learning Behavioral Control for Nonlinear Autonomous System

文献摘要：

Behavior-based autonomous systems rely on human intelligence to resolve multi-mission conflicts by designing mission priority rules and nonlinear controllers.In this work,a novel two-layer reinforcement learning behavioral control(RLBC)method is proposed to reduce such dependence by trial-and-error learning.Specifically,in the upper layer,a reinforcement learning mission supervisor(RLMS)is designed to learn the optimal mission priority.Compared with existing mission supervisors,the RLMS improves the dynamic performance of mission priority adjustment by maximizing cumulative rewards and reducing hardware storage demand when using neural networks.In the lower layer,a reinforcement learning controller(RLC)is designed to learn the optimal control policy.Compared with existing behavioral controllers,the RLC reduces the control cost of mission priority adjustment by balancing control performance and consumption.All error signals are proved to be semi-globally uniformly ultimately bounded(SGUUB).Simulation results show that the number of mission priority adjustment and the control cost are significantly reduced compared to some existing mission supervisors and behavioral controllers,respectively.

文献关键词：

中图分类号：

[1] 自动化技术、计算机技术（TP） / 计算技术、计算机技术（TP3） / 计算机的应用（TP39） / 信息处理(信息加工)（TP391）

[2] 自动化技术、计算机技术（TP） / 自动化基础理论（TP1） / 人工智能理论（TP18） / 自动推理、机器学习（TP181）

[3] 天文学、地球科学（P） / 地球物理学（P3） / 空间物理（P35）

作者姓名：

Zhenyi Zhang;Zhibin Mo;Yutao Chen;Jie Huang

作者机构：

College of Electrical Engineering and Automation,Fuzhou University,Fuzhou 350108;Key Laboratory of Industrial Automation Control Technology and Information Processing,Education Department of Fujian Province,Fuzhou 350108;G+Industrial Internet Institute,Fuzhou University,Fuzhou 350108,China

文献出处：

自动化学报（英文版）

引用格式：

[1]Zhenyi Zhang;Zhibin Mo;Yutao Chen;Jie Huang-.Reinforcement Learning Behavioral Control for Nonlinear Autonomous System)[J].自动化学报（英文版）,2022(09):1561-1573

A类：

RLBC,SGUUB

B类：

Reinforcement,Learning,Behavioral,Control,Nonlinear,Autonomous,System,autonomous,systems,rely,human,intelligence,resolve,multi,mission,conflicts,by,designing,priority,rules,nonlinear,controllers,In,this,novel,layer,reinforcement,learning,behavioral,method,proposed,such,dependence,trial,error,Specifically,upper,RLMS,designed,optimal,Compared,existing,supervisors,improves,dynamic,performance,adjustment,maximizing,cumulative,rewards,reducing,hardware,storage,demand,when,using,neural,networks,lower,RLC,policy,reduces,cost,balancing,consumption,All,signals,proved,semi,globally,uniformly,ultimately,bounded,Simulation,results,show,that,number,significantly,reduced,compared,some,respectively

AB值：

0.516452

相似文献

Joint Scheduling and Resource Allocation for Federated Learning in SWIPT-Enabled Micro UAV Swarm Networks

Wanli Wen;Yunjian Jia;Wenchao Xia-School of Microelectronics and Communication Engineering,Chongqing University,Chongqing 400044,China;National Mobile Communications Research Laboratory,Southeast University,Nanjing 210009,China;Jiangsu Key Laboratory of Wireless Communications,Nanjing University of Posts and Telecommunications,Nanjing 210003,China

Cloud-Assisted Distributed Edge Brains for Multi-Cell Joint Beamforming Optimization for 6G

Juan Deng;Kaicong Tian;Qingbi Zheng;Jielin Bai;Kuo Cui;Yitong Liu;Guangyi Liu-Future Research Lab,China Mobile Research Institute,Beijing 100032,China;Beijing University of Posts and Telecommunications,Beijing 100876,China

Transmit Diversity Scheme Design for Rectangular Pulse Shaping Based OTFS

Dong Wang;Bule Sun;Fanggang Wang;Xiran Li;Pu Yuan;Dajie Jiang-State Key Laboratory of Rail Traffic Control and Safety,Beijing Jiaotong University,Beijing 00044,China;vivo Mobile Communication Co.,Ltd.,Beijing 100016,China

Multi-Agent Few-Shot Meta Reinforcement Learning for Trajectory Design and Channel Selection in UAV-Assisted Networks

Shiyang Zhou;Yufan Cheng;Xia Lei;Huanhuan Duan-National Key Laboratory of Science and Technology on Communications,University of Electronic Science and Technology of China,Chengdu 611731,China

Linear Network Coding Based Fast Data Synchronization for Wireless Ad Hoc Networks with Controlled Topology