Autonomous Input Voltage Sharing Control and Triple Phase Shift Modulation Method for ISOP-DAB Converter in DC Microgrid: A Multiagent Deep Reinforcement Learning-Based Method

Yu Zeng; Josep Pou; Changjiang Sun; Suvajit Mukherjee; Xu Xu; Amit Kumar Gupta; Jiaxin Dong

doi:10.1109/TPEL.2022.3218900

Autonomous Input Voltage Sharing Control and Triple Phase Shift Modulation Method for ISOP-DAB Converter in DC Microgrid: A Multiagent Deep Reinforcement Learning-Based Method

Yu Zeng^*, Josep Pou, Changjiang Sun, Suvajit Mukherjee, Xu Xu, Amit Kumar Gupta, Jiaxin Dong

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

53 Citations (Scopus)

Abstract

This article proposes a multiagent (MA) deep reinforcement learning (DRL) based autonomous input voltage sharing (IVS) control and triple phase shift modulation method for input-series output-parallel (ISOP) dual active bridge (DAB) converters to solve the three challenges: the uncertainties of the dc microgrid, the power balance problem, and the current stress minimization of the converter. Specifically, the control and modulation problem of the ISOP-DAB converter is formed as a Markov game with several DRL agents. Subsequently, the MA twin-delayed deep deterministic policy gradient (MA-TD3) algorithm is applied to train the DRL agents in an offline manner. After the training process, the multiple agents can provide online control decisions for the ISOP-DAB converter to balance the IVS, and minimize the current stress among different submodules. Without accurate model information, the proposed method can adaptively obtain the optimal modulation variable combinations in a stochastic and uncertain environment. Simulation and experimental results verify the effectiveness of the proposed MA-TD3-based algorithm.

Original language	English
Pages (from-to)	2985-3000
Number of pages	16
Journal	IEEE Transactions on Power Electronics
Volume	38
Issue number	3
DOIs	https://doi.org/10.1109/TPEL.2022.3218900
Publication status	Published - Mar 1 2023
Externally published	Yes

Bibliographical note

Publisher Copyright:
© 1986-2012 IEEE.

ASJC Scopus Subject Areas

Electrical and Electronic Engineering

Keywords

input voltage sharing (IVS)
Input-series output-parallel-connected dual active bridge (ISOP-DAB) converter
multiagent twin-delayed deep deterministic policy gradient (MA-TD3)
triple phase shift modulation

Access to Document

10.1109/TPEL.2022.3218900

Cite this

@article{729e7eeb5f1b40c78e6c54801eeb1e46,

title = "Autonomous Input Voltage Sharing Control and Triple Phase Shift Modulation Method for ISOP-DAB Converter in DC Microgrid: A Multiagent Deep Reinforcement Learning-Based Method",

abstract = "This article proposes a multiagent (MA) deep reinforcement learning (DRL) based autonomous input voltage sharing (IVS) control and triple phase shift modulation method for input-series output-parallel (ISOP) dual active bridge (DAB) converters to solve the three challenges: the uncertainties of the dc microgrid, the power balance problem, and the current stress minimization of the converter. Specifically, the control and modulation problem of the ISOP-DAB converter is formed as a Markov game with several DRL agents. Subsequently, the MA twin-delayed deep deterministic policy gradient (MA-TD3) algorithm is applied to train the DRL agents in an offline manner. After the training process, the multiple agents can provide online control decisions for the ISOP-DAB converter to balance the IVS, and minimize the current stress among different submodules. Without accurate model information, the proposed method can adaptively obtain the optimal modulation variable combinations in a stochastic and uncertain environment. Simulation and experimental results verify the effectiveness of the proposed MA-TD3-based algorithm.",

keywords = "input voltage sharing (IVS), Input-series output-parallel-connected dual active bridge (ISOP-DAB) converter, multiagent twin-delayed deep deterministic policy gradient (MA-TD3), triple phase shift modulation",

author = "Yu Zeng and Josep Pou and Changjiang Sun and Suvajit Mukherjee and Xu Xu and Gupta, \{Amit Kumar\} and Jiaxin Dong",

note = "Publisher Copyright: {\textcopyright} 1986-2012 IEEE.",

year = "2023",

month = mar,

day = "1",

doi = "10.1109/TPEL.2022.3218900",

language = "English",

volume = "38",

pages = "2985--3000",

journal = "IEEE Transactions on Power Electronics",

issn = "0885-8993",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "3",

}

Autonomous Input Voltage Sharing Control and Triple Phase Shift Modulation Method for ISOP-DAB Converter in DC Microgrid: A Multiagent Deep Reinforcement Learning-Based Method. / Zeng, Yu; Pou, Josep; Sun, Changjiang et al.
In: IEEE Transactions on Power Electronics, Vol. 38, No. 3, 01.03.2023, p. 2985-3000.

Research output: Contribution to journal › Article › peer-review

TY - JOUR

T1 - Autonomous Input Voltage Sharing Control and Triple Phase Shift Modulation Method for ISOP-DAB Converter in DC Microgrid

T2 - A Multiagent Deep Reinforcement Learning-Based Method

AU - Zeng, Yu

AU - Pou, Josep

AU - Sun, Changjiang

AU - Mukherjee, Suvajit

AU - Xu, Xu

AU - Gupta, Amit Kumar

AU - Dong, Jiaxin

PY - 2023/3/1

Y1 - 2023/3/1

N2 - This article proposes a multiagent (MA) deep reinforcement learning (DRL) based autonomous input voltage sharing (IVS) control and triple phase shift modulation method for input-series output-parallel (ISOP) dual active bridge (DAB) converters to solve the three challenges: the uncertainties of the dc microgrid, the power balance problem, and the current stress minimization of the converter. Specifically, the control and modulation problem of the ISOP-DAB converter is formed as a Markov game with several DRL agents. Subsequently, the MA twin-delayed deep deterministic policy gradient (MA-TD3) algorithm is applied to train the DRL agents in an offline manner. After the training process, the multiple agents can provide online control decisions for the ISOP-DAB converter to balance the IVS, and minimize the current stress among different submodules. Without accurate model information, the proposed method can adaptively obtain the optimal modulation variable combinations in a stochastic and uncertain environment. Simulation and experimental results verify the effectiveness of the proposed MA-TD3-based algorithm.

AB - This article proposes a multiagent (MA) deep reinforcement learning (DRL) based autonomous input voltage sharing (IVS) control and triple phase shift modulation method for input-series output-parallel (ISOP) dual active bridge (DAB) converters to solve the three challenges: the uncertainties of the dc microgrid, the power balance problem, and the current stress minimization of the converter. Specifically, the control and modulation problem of the ISOP-DAB converter is formed as a Markov game with several DRL agents. Subsequently, the MA twin-delayed deep deterministic policy gradient (MA-TD3) algorithm is applied to train the DRL agents in an offline manner. After the training process, the multiple agents can provide online control decisions for the ISOP-DAB converter to balance the IVS, and minimize the current stress among different submodules. Without accurate model information, the proposed method can adaptively obtain the optimal modulation variable combinations in a stochastic and uncertain environment. Simulation and experimental results verify the effectiveness of the proposed MA-TD3-based algorithm.

KW - input voltage sharing (IVS)

KW - Input-series output-parallel-connected dual active bridge (ISOP-DAB) converter

KW - multiagent twin-delayed deep deterministic policy gradient (MA-TD3)

KW - triple phase shift modulation

UR - http://www.scopus.com/inward/record.url?scp=85141581566&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85141581566&partnerID=8YFLogxK

U2 - 10.1109/TPEL.2022.3218900

DO - 10.1109/TPEL.2022.3218900

M3 - Article

AN - SCOPUS:85141581566

SN - 0885-8993

VL - 38

SP - 2985

EP - 3000

JO - IEEE Transactions on Power Electronics

JF - IEEE Transactions on Power Electronics

IS - 3

ER -

Autonomous Input Voltage Sharing Control and Triple Phase Shift Modulation Method for ISOP-DAB Converter in DC Microgrid: A Multiagent Deep Reinforcement Learning-Based Method

Abstract

Bibliographical note

ASJC Scopus Subject Areas

Keywords

Access to Document

Other files and links

Fingerprint

Reports Summarize Technology Study Results from Nanyang Technological University (Autonomous Input Voltage Sharing Control and Triple Phase Shift Modulation Method for Isop-dab Converter In Dc Microgrid: a Multiagent Deep Reinforcement ...)

Cite this

Autonomous Input Voltage Sharing Control and Triple Phase Shift Modulation Method for ISOP-DAB Converter in DC Microgrid: A Multiagent Deep Reinforcement Learning-Based Method

Abstract

Bibliographical note

ASJC Scopus Subject Areas

Keywords

Access to Document

Other files and links

Fingerprint

Press/Media

Reports Summarize Technology Study Results from Nanyang Technological University (Autonomous Input Voltage Sharing Control and Triple Phase Shift Modulation Method for Isop-dab Converter In Dc Microgrid: a Multiagent Deep Reinforcement ...)

Cite this