Control Method for PEMFC Using Improved Deep Deterministic Policy Gradient Algorithm

A data-driven PEMFC output voltage control method is proposed. Moreover, an Improved deep deterministic policy gradient algorithm is proposed for this method. The algorithm introduces three techniques: Clipped multiple Q-learning, policy delay update, and policy smoothing to improve the robustness of the control policy. In this algorithm, the hydrogen controller is treated as an agent, which is pre-trained to fully interact with the environment and obtain the optimal control policy. The effectiveness of the proposed algorithm is demonstrated experimentally.

Download Full-text

Distributed Imitation-Orientated Deep Reinforcement Learning Method for Optimal PEMFC Output Voltage Control

Frontiers in Energy Research ◽

10.3389/fenrg.2021.741101 ◽

2021 ◽

Vol 9 ◽

Author(s):

Jiawen Li ◽

Yaping Li ◽

Tao Yu

Keyword(s):

Reinforcement Learning ◽

Control Strategy ◽

Output Voltage ◽

Proton Exchange Membrane ◽

Control Method ◽

Voltage Control ◽

Proton Exchange ◽

Gradient Algorithm ◽

Voltage Control Strategy ◽

The Stability

In order to improve the stability of proton exchange membrane fuel cell (PEMFC) output voltage, a data-driven output voltage control strategy based on regulation of the duty cycle of the DC-DC converter is proposed in this paper. In detail, an imitation-oriented twin delay deep deterministic (IO-TD3) policy gradient algorithm which offers a more robust voltage control strategy is demonstrated. This proposed output voltage control method is a distributed deep reinforcement learning training framework, the design of which is guided by the pedagogic concept of imitation learning. The effectiveness of the proposed control strategy is experimentally demonstrated.

Download Full-text

A Combined Policy Gradient and Q-learning Method for Data-driven Optimal Control Problems

2019 9th International Conference on Information Science and Technology (ICIST) ◽

10.1109/icist.2019.8836932 ◽

2019 ◽

Author(s):

Mingduo Lin ◽

Derong Liu ◽

Bo Zhao ◽

Qionghai Dai ◽

Yi Dong

Keyword(s):

Optimal Control ◽

Optimal Control Problems ◽

Data Driven ◽

Control Problems ◽

Learning Method ◽

Q Learning ◽

Policy Gradient

Download Full-text

Output voltage control method of PWM-controlled cycloconverters with space vectors

Electrical Engineering in Japan ◽

10.1002/eej.4391110512 ◽

1991 ◽

Vol 111 (5) ◽

pp. 117-126 ◽

Cited By ~ 1

Author(s):

Akio Ishiguro ◽

Takeshi Furuhashi ◽

Shigeru Okuma ◽

Yoshiki Uchikawa ◽

Muneaki Ishida

Keyword(s):

Output Voltage ◽

Control Method ◽

Voltage Control ◽

Space Vectors

Download Full-text

ON constant output voltage control method for Buck-Boost type SMR

2010 International Conference on Educational and Information Technology ◽

10.1109/iceit.2010.5608360 ◽

2010 ◽

Author(s):

Yao Xu Dong ◽

Jia Da Chun

Keyword(s):

Output Voltage ◽

Control Method ◽

Voltage Control ◽

Constant Output

Download Full-text

A Modified Deep Deterministic Policy Gradient Algorithm for Data-Driven Inventory Management

Journal of the Korean Society of Supply Chain Management ◽

10.25052/kscm.2021.12.21.3.71 ◽

2021 ◽

Vol 21 (3) ◽

pp. 71-89

Author(s):

Byeongkwon Lee ◽

Kun-Soo Park ◽

Se-Youn Jung

Keyword(s):

Inventory Management ◽

Gradient Algorithm ◽

Data Driven ◽

Policy Gradient

Download Full-text

Using a neural network to obtain an output voltage control method in an interconnection inverter

Electronics and Communications in Japan (Part I Communications) ◽

10.1002/ecja.4410790902 ◽

1996 ◽

Vol 79 (9) ◽

pp. 11-18

Author(s):

Kunitoshi Tazume ◽

Tadahito Aoki ◽

Yutaka Kuwata ◽

Yousuke Nozaki

Keyword(s):

Neural Network ◽

Output Voltage ◽

Control Method ◽

Voltage Control

Download Full-text

A data-driven output voltage control of solid oxide fuel cell using multi-agent deep reinforcement learning

Applied Energy ◽

10.1016/j.apenergy.2021.117541 ◽

2021 ◽

Vol 304 ◽

pp. 117541 ◽

Cited By ~ 6

Author(s):

Jiawen Li ◽

Tao Yu ◽

Bo Yang

Keyword(s):

Reinforcement Learning ◽

Fuel Cell ◽

Solid Oxide Fuel Cell ◽

Output Voltage ◽

Voltage Control ◽

Solid Oxide ◽

Data Driven ◽

Oxide Fuel ◽

Multi Agent

Download Full-text

Improved Q-Learning Method for Linear Discrete-Time Systems

Processes ◽

10.3390/pr8030368 ◽

2020 ◽

Vol 8 (3) ◽

pp. 368

Author(s):

Jian Chen ◽

Jinhua Wang ◽

Jie Huang

Keyword(s):

Optimal Control ◽

Linear Systems ◽

Discrete Time ◽

Control Method ◽

Learning Method ◽

Control Laws ◽

Q Learning ◽

Model Free ◽

Quadratic Optimal Control ◽

Time Linear

In this paper, the Q-learning method for quadratic optimal control problem of discrete-time linear systems is reconsidered. The theoretical results prove that the quadratic optimal controller cannot be solved directly due to the linear correlation of the data sets. The following corollaries have been made: (1) The correlation of data is the key factor in the success for the calculation of quadratic optimal control laws by Q-learning method; (2) The control laws for linear systems cannot be derived directly by the existing Q-learning method; (3) For nonlinear systems, there are some doubts about the data independence of current method. Therefore, it is necessary to discuss the probability of the controllers established by the existing Q-learning method. To solve this problem, based on the ridge regression, an improved model-free Q-learning quadratic optimal control method for discrete-time linear systems is proposed in this paper. Therefore, the computation process can be implemented correctly, and the effective controller can be solved. The simulation results show that the proposed method can not only overcome the problem caused by the data correlation, but also derive proper control laws for discrete-time linear systems.

Download Full-text