Learning to Schedule Joint Radar-Communication With Deep Multi-Agent Reinforcement Learning

Joash Lee; Dusit Niyato; Yong Liang Guan; Dong In Kim

doi:10.1109/TVT.2021.3124810

Learning to Schedule Joint Radar-Communication With Deep Multi-Agent Reinforcement Learning

Joash Lee, Dusit Niyato, Yong Liang Guan, Dong In Kim^*

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

27 Citations (Scopus)

Abstract

Radar detection and communication are two essential sub-tasks for the operation of next-generation autonomous vehicles (AVs). The forthcoming proliferation of faster 5G networks utilizing mmWave has raised concerns on interference with automotive radar sensors, which has led to a body of research on Joint Radar-Communication (JRC). This paper considers the problem of time-sharing for JRC, with the additional simultaneous objective of minimizing the average age of information (AoI) transmitted by a JRC-equipped AV. We first formulate the problem as a Markov Decision Process (MDP). We then propose a more general multi-agent system, with an appropriate medium access control (MAC) protocol, which is formulated as a partially observed Markov game (POMG). To solve the POMG, we propose a multi-agent extension of the Proximal Policy Optimization (PPO) algorithm, along with algorithmic features to enhance learning from raw observations. Simulations are run with a range of environmental parameters to mimic variations in real-world operation. The results show that the chosen deep reinforcement learning methods allow the agents to obtain strong performance with minimal a priori knowledge about the environment.

Original language	English
Pages (from-to)	406-422
Number of pages	17
Journal	IEEE Transactions on Vehicular Technology
Volume	71
Issue number	1
DOIs	https://doi.org/10.1109/TVT.2021.3124810
Publication status	Published - Jan 1 2022
Externally published	Yes

Bibliographical note

Publisher Copyright:
© 1967-2012 IEEE.

ASJC Scopus Subject Areas

Automotive Engineering
Aerospace Engineering
Computer Networks and Communications
Electrical and Electronic Engineering

Keywords

communication
deep learning
Reinforcement learning
task scheduling
vehicle safety

Access to Document

10.1109/TVT.2021.3124810

Cite this

@article{ba9306cf2d0748d2bebe80dab84a220c,

title = "Learning to Schedule Joint Radar-Communication With Deep Multi-Agent Reinforcement Learning",

abstract = "Radar detection and communication are two essential sub-tasks for the operation of next-generation autonomous vehicles (AVs). The forthcoming proliferation of faster 5G networks utilizing mmWave has raised concerns on interference with automotive radar sensors, which has led to a body of research on Joint Radar-Communication (JRC). This paper considers the problem of time-sharing for JRC, with the additional simultaneous objective of minimizing the average age of information (AoI) transmitted by a JRC-equipped AV. We first formulate the problem as a Markov Decision Process (MDP). We then propose a more general multi-agent system, with an appropriate medium access control (MAC) protocol, which is formulated as a partially observed Markov game (POMG). To solve the POMG, we propose a multi-agent extension of the Proximal Policy Optimization (PPO) algorithm, along with algorithmic features to enhance learning from raw observations. Simulations are run with a range of environmental parameters to mimic variations in real-world operation. The results show that the chosen deep reinforcement learning methods allow the agents to obtain strong performance with minimal a priori knowledge about the environment.",

keywords = "communication, deep learning, Reinforcement learning, task scheduling, vehicle safety",

author = "Joash Lee and Dusit Niyato and Guan, {Yong Liang} and Kim, {Dong In}",

note = "Publisher Copyright: {\textcopyright} 1967-2012 IEEE.",

year = "2022",

month = jan,

day = "1",

doi = "10.1109/TVT.2021.3124810",

language = "English",

volume = "71",

pages = "406--422",

journal = "IEEE Transactions on Vehicular Technology",

issn = "0018-9545",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "1",

}

TY - JOUR

T1 - Learning to Schedule Joint Radar-Communication With Deep Multi-Agent Reinforcement Learning

AU - Lee, Joash

AU - Niyato, Dusit

AU - Guan, Yong Liang

AU - Kim, Dong In

PY - 2022/1/1

Y1 - 2022/1/1

N2 - Radar detection and communication are two essential sub-tasks for the operation of next-generation autonomous vehicles (AVs). The forthcoming proliferation of faster 5G networks utilizing mmWave has raised concerns on interference with automotive radar sensors, which has led to a body of research on Joint Radar-Communication (JRC). This paper considers the problem of time-sharing for JRC, with the additional simultaneous objective of minimizing the average age of information (AoI) transmitted by a JRC-equipped AV. We first formulate the problem as a Markov Decision Process (MDP). We then propose a more general multi-agent system, with an appropriate medium access control (MAC) protocol, which is formulated as a partially observed Markov game (POMG). To solve the POMG, we propose a multi-agent extension of the Proximal Policy Optimization (PPO) algorithm, along with algorithmic features to enhance learning from raw observations. Simulations are run with a range of environmental parameters to mimic variations in real-world operation. The results show that the chosen deep reinforcement learning methods allow the agents to obtain strong performance with minimal a priori knowledge about the environment.

AB - Radar detection and communication are two essential sub-tasks for the operation of next-generation autonomous vehicles (AVs). The forthcoming proliferation of faster 5G networks utilizing mmWave has raised concerns on interference with automotive radar sensors, which has led to a body of research on Joint Radar-Communication (JRC). This paper considers the problem of time-sharing for JRC, with the additional simultaneous objective of minimizing the average age of information (AoI) transmitted by a JRC-equipped AV. We first formulate the problem as a Markov Decision Process (MDP). We then propose a more general multi-agent system, with an appropriate medium access control (MAC) protocol, which is formulated as a partially observed Markov game (POMG). To solve the POMG, we propose a multi-agent extension of the Proximal Policy Optimization (PPO) algorithm, along with algorithmic features to enhance learning from raw observations. Simulations are run with a range of environmental parameters to mimic variations in real-world operation. The results show that the chosen deep reinforcement learning methods allow the agents to obtain strong performance with minimal a priori knowledge about the environment.

KW - communication

KW - deep learning

KW - Reinforcement learning

KW - task scheduling

KW - vehicle safety

UR - http://www.scopus.com/inward/record.url?scp=85118636462&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85118636462&partnerID=8YFLogxK

U2 - 10.1109/TVT.2021.3124810

DO - 10.1109/TVT.2021.3124810

M3 - Article

AN - SCOPUS:85118636462

SN - 0018-9545

VL - 71

SP - 406

EP - 422

JO - IEEE Transactions on Vehicular Technology

JF - IEEE Transactions on Vehicular Technology

IS - 1

ER -

Learning to Schedule Joint Radar-Communication With Deep Multi-Agent Reinforcement Learning

Abstract

Bibliographical note

ASJC Scopus Subject Areas

Keywords

Access to Document

Other files and links

Fingerprint

Cite this