Abstract
This article proposes a multiagent (MA) deep reinforcement learning (DRL) based autonomous input voltage sharing (IVS) control and triple phase shift modulation method for input-series output-parallel (ISOP) dual active bridge (DAB) converters to solve the three challenges: the uncertainties of the dc microgrid, the power balance problem, and the current stress minimization of the converter. Specifically, the control and modulation problem of the ISOP-DAB converter is formed as a Markov game with several DRL agents. Subsequently, the MA twin-delayed deep deterministic policy gradient (MA-TD3) algorithm is applied to train the DRL agents in an offline manner. After the training process, the multiple agents can provide online control decisions for the ISOP-DAB converter to balance the IVS, and minimize the current stress among different submodules. Without accurate model information, the proposed method can adaptively obtain the optimal modulation variable combinations in a stochastic and uncertain environment. Simulation and experimental results verify the effectiveness of the proposed MA-TD3-based algorithm.
Original language | English |
---|---|
Pages (from-to) | 2985-3000 |
Number of pages | 16 |
Journal | IEEE Transactions on Power Electronics |
Volume | 38 |
Issue number | 3 |
DOIs | |
Publication status | Published - Mar 1 2023 |
Externally published | Yes |
Bibliographical note
Publisher Copyright:© 1986-2012 IEEE.
ASJC Scopus Subject Areas
- Electrical and Electronic Engineering
Keywords
- input voltage sharing (IVS)
- Input-series output-parallel-connected dual active bridge (ISOP-DAB) converter
- multiagent twin-delayed deep deterministic policy gradient (MA-TD3)
- triple phase shift modulation