Value Distribution DDPG With Dual-Prioritized Experience Replay for Coordinated Control of Coal-Fired Power Generation Systems

Xiaomin Liu, Mengjun Yu,Chunyu Yang,Linna Zhou,Haoyu Wang, Huaichun Zhou

IEEE Transactions on Industrial Informatics(2024)

引用 0|浏览0
暂无评分
摘要
The grid connection of renewable energy poses challenges to the coordinated control of coal-fired power generation systems. Model uncertainty makes model-driven methods less effective due to the lack of adaptive capability. Large inertia of thermal process leads to local aggregation of state information, and the direct grafting reinforcement learning methods will affect the learning efficiency due to insufficient data utilization. To this end, this article proposes dual-prioritized experience replay value distribution deep deterministic policy gradient (DPER-VDP3G) algorithm. Value distribution is introduced to reflect the influence of model uncertainty on the evaluation of coordinated control policy, thus improving the accuracy of prediction cost function. The DPER is designed to reduce the nonuniform sampling bias and remove redundant data to enhance sample diversity. Comparative experiments demonstrate the advantages of the proposed method for improving network training efficiency, ameliorating load tracking accuracy and speed, and reducing energy consumption.
更多
查看译文
关键词
Coal-fired power generation systems,coordinated control,deep deterministic policy gradient (DDPG),dual-prioritized experience replay (DPER),value distribution
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要