Day-ahead scheduling based on reinforcement learning with hybrid action space|CAO Jingyu;DONG Lu;SUN Changyin|School of Cyber Science and Engineering, Southeast University, Nanjing 211189, China - 期刊导航|首站-论文投稿智能助手|论文发表|论文智能投稿|期刊自助发表推荐|杂志社快速发表|查同导刊-域田数据官方网站

典型文献

Day-ahead scheduling based on reinforcement learning with hybrid action space

文献摘要：

Driven by the improvement of the smart grid, the active distribution network (ADN) has attracted much attention due to its characteristic of active management. By making full use of electricity price signals for optimal scheduling, the total cost of the ADN can be reduced. However, the optimal day-ahead scheduling problem is challenging since the future electri-city price is unknown. Moreover, in ADN, some schedulable vari-ables are continuous while some schedulable variables are dis-crete, which increases the difficulty of determining the optimal scheduling scheme. In this paper, the day-ahead scheduling problem of the ADN is formulated as a Markov decision process (MDP) with continuous-discrete hybrid action space. Then, an algorithm based on multi-agent hybrid reinforcement learning (HRL) is proposed to obtain the optimal scheduling scheme. The proposed algorithm adopts the structure of centralized training and decentralized execution, and different methods are applied to determine the selection policy of continuous scheduling vari-ables and discrete scheduling variables. The simulation experi-ment results demonstrate the effectiveness of the algorithm.

文献关键词：

中图分类号：

[1] 自动化技术、计算机技术（TP） / 计算技术、计算机技术（TP3） / 计算机的应用（TP39） / 信息处理(信息加工)（TP391）

[2] 自动化技术、计算机技术（TP） / 自动化基础理论（TP1） / 人工智能理论（TP18）

[3] 数理科学和化学（O） / 力学（O3） / 振动理论（O32） / 非线性振动（O322）

作者姓名：

CAO Jingyu;DONG Lu;SUN Changyin

作者机构：

School of Automation, Southeast University, Nanjing 210096, China;School of Cyber Science and Engineering, Southeast University, Nanjing 211189, China

文献出处：

系统工程与电子技术（英文版）

引用格式：

[1]CAO Jingyu;DONG Lu;SUN Changyin-.Day-ahead scheduling based on reinforcement learning with hybrid action space)[J].系统工程与电子技术（英文版）,2022(03):693-705

A类：

schedulable

B类：

Day,ahead,scheduling,reinforcement,learning,hybrid,action,space,Driven,by,improvement,smart,grid,active,distribution,network,ADN,has,attracted,much,attention,due,its,characteristic,management,By,making,full,use,electricity,price,signals,optimal,total,cost,can,be,reduced,However,day,problem,challenging,since,future,unknown,Moreover,some,are,continuous,while,variables,which,increases,difficulty,determining,scheme,In,this,paper,formulated,Markov,decision,process,MDP,discrete,Then,algorithm,multi,agent,HRL,proposed,obtain,adopts,structure,training,decentralized,execution,different,methods,applied,determine,selection,policy,simulation,experi,results,demonstrate,effectiveness

AB值：

0.513473

相似文献

Towards autonomous and optimal excavation of shield machine:a deep reinforcement learning-based approach

Ya-kun ZHANG;Guo-fang GONG;Hua-yong YANG;Yu-xi CHEN;Geng-lin CHEN-State Key Laboratory of Fluid Power and Mechatronic Systems,Zhejiang University,Hangzhou 310027,China;School of Electrical and Power Engineering,China University of Mining and Technology,Xuzhou 221116,China

Minimax Q-learning design for H∞ control of linear discrete-time systems

Xinxing LI;Lele XI;Wenzhong ZHA;Zhihong PENG-Information Science Academy,China Electronics Technology Group Corporation,Beijing 100086,China;School of Automation,Beijing Institute of Technology,Beijing 100081,China;Peng Cheng Laboratory,Shenzhen 518052,China

Multi-agent differential game based cooperative synchronization control using a data-driven method

Yu SHI;Yongzhao HUA;Jianglong YU;Xiwang DONG;Zhang REN-School of Automation Science and Electrical Engineering,Beihang University,Beijing 100191,China;Institute of Artificial Intelligence,Beihang University,Beijing 100191,China

Behavioral control task supervisor with memory based on reinforcement learning for human-multi-robot coordination systems

Jie HUANG;Zhibin MO;Zhenyi ZHANG;Yutao CHEN-School of Electrical Engineering and Automation,Fuzhou University,Fuzhou 350108,China;G+Industrial Internet Institute,Fuzhou University,Fuzhou 350108,China;Key Laboratory of Industrial Automation Control Technology and Information Processing of Fujian Province,Fuzhou University,Fuzhou 350108,China

Training time minimization for federated edge learning with optimized gradient quantization and bandwidth allocation