Guaranteeing Out-Of-Distribution Detection in Deep RL via Transition Estimation

Mohit Prashant; Arvind Easwaran; Suman Das; Michael Yuhas

doi:10.1609/aaai.v39i12.33357

Guaranteeing Out-Of-Distribution Detection in Deep RL via Transition Estimation

Mohit Prashant, Arvind Easwaran, Suman Das, Michael Yuhas

Nanyang Technological University

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

An issue concerning the use of deep reinforcement learning (RL) agents is whether they can be trusted to perform reliably when deployed, as training environments may not reflect real-life environments. Anticipating instances outside their training scope, learning-enabled systems are often equipped with out-of-distribution (OOD) detectors that alert when a trained system encounters a state it does not recognize or in which it exhibits uncertainty. There exists limited work conducted on the problem of OOD detection within RL, with prior studies being unable to achieve a consensus on the definition of OOD execution within the context of RL. By framing our problem using a Markov Decision Process, we assume there is a transition distribution mapping each state-action pair to another state with some probability. Based on this, we consider the following definition of OOD execution within RL: A transition is OOD if its probability during real-life deployment differs from the transition distribution encountered during training. As such, we utilize conditional variational autoencoders (CVAE) to approximate the transition dynamics of the training environment and implement a conformity-based detector using reconstruction loss that is able to guarantee OOD detection with a pre-determined confidence level. We evaluate our detector by adapting existing benchmarks and compare it with existing OOD detection models for RL.

Original language	English
Title of host publication	Special Track on AI Alignment
Editors	Toby Walsh, Julie Shah, Zico Kolter
Publisher	Association for the Advancement of Artificial Intelligence
Pages	12452-12460
Number of pages	9
Edition	12
ISBN (Electronic)	157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978
DOIs	https://doi.org/10.1609/aaai.v39i12.33357
Publication status	Published - Apr 11 2025
Externally published	Yes
Event	39th Annual AAAI Conference on Artificial Intelligence, AAAI 2025 - Philadelphia, United States Duration: Feb 25 2025 → Mar 4 2025

Publication series

Name	Proceedings of the AAAI Conference on Artificial Intelligence
Number	12
Volume	39
ISSN (Print)	2159-5399
ISSN (Electronic)	2374-3468

Conference

Conference	39th Annual AAAI Conference on Artificial Intelligence, AAAI 2025
Country/Territory	United States
City	Philadelphia
Period	2/25/25 → 3/4/25

Bibliographical note

Publisher Copyright:
Copyright © 2025, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.

ASJC Scopus Subject Areas

Artificial Intelligence

Access to Document

10.1609/aaai.v39i12.33357

Cite this

Prashant, M., Easwaran, A., Das, S., & Yuhas, M. (2025). Guaranteeing Out-Of-Distribution Detection in Deep RL via Transition Estimation. In T. Walsh, J. Shah, & Z. Kolter (Eds.), Special Track on AI Alignment (12 ed., pp. 12452-12460). (Proceedings of the AAAI Conference on Artificial Intelligence; Vol. 39, No. 12). Association for the Advancement of Artificial Intelligence. https://doi.org/10.1609/aaai.v39i12.33357

Prashant, Mohit ; Easwaran, Arvind ; Das, Suman et al. / Guaranteeing Out-Of-Distribution Detection in Deep RL via Transition Estimation. Special Track on AI Alignment. editor / Toby Walsh ; Julie Shah ; Zico Kolter. 12. ed. Association for the Advancement of Artificial Intelligence, 2025. pp. 12452-12460 (Proceedings of the AAAI Conference on Artificial Intelligence; 12).

@inproceedings{8f5e83ffb973442db4c2bc5996beba27,

title = "Guaranteeing Out-Of-Distribution Detection in Deep RL via Transition Estimation",

abstract = "An issue concerning the use of deep reinforcement learning (RL) agents is whether they can be trusted to perform reliably when deployed, as training environments may not reflect real-life environments. Anticipating instances outside their training scope, learning-enabled systems are often equipped with out-of-distribution (OOD) detectors that alert when a trained system encounters a state it does not recognize or in which it exhibits uncertainty. There exists limited work conducted on the problem of OOD detection within RL, with prior studies being unable to achieve a consensus on the definition of OOD execution within the context of RL. By framing our problem using a Markov Decision Process, we assume there is a transition distribution mapping each state-action pair to another state with some probability. Based on this, we consider the following definition of OOD execution within RL: A transition is OOD if its probability during real-life deployment differs from the transition distribution encountered during training. As such, we utilize conditional variational autoencoders (CVAE) to approximate the transition dynamics of the training environment and implement a conformity-based detector using reconstruction loss that is able to guarantee OOD detection with a pre-determined confidence level. We evaluate our detector by adapting existing benchmarks and compare it with existing OOD detection models for RL.",

author = "Mohit Prashant and Arvind Easwaran and Suman Das and Michael Yuhas",

note = "Publisher Copyright: Copyright {\textcopyright} 2025, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.; 39th Annual AAAI Conference on Artificial Intelligence, AAAI 2025 ; Conference date: 25-02-2025 Through 04-03-2025",

year = "2025",

month = apr,

day = "11",

doi = "10.1609/aaai.v39i12.33357",

language = "English",

series = "Proceedings of the AAAI Conference on Artificial Intelligence",

publisher = "Association for the Advancement of Artificial Intelligence",

number = "12",

pages = "12452--12460",

editor = "Toby Walsh and Julie Shah and Zico Kolter",

booktitle = "Special Track on AI Alignment",

edition = "12",

}

Prashant, M, Easwaran, A, Das, S & Yuhas, M 2025, Guaranteeing Out-Of-Distribution Detection in Deep RL via Transition Estimation. in T Walsh, J Shah & Z Kolter (eds), Special Track on AI Alignment. 12 edn, Proceedings of the AAAI Conference on Artificial Intelligence, no. 12, vol. 39, Association for the Advancement of Artificial Intelligence, pp. 12452-12460, 39th Annual AAAI Conference on Artificial Intelligence, AAAI 2025, Philadelphia, United States, 2/25/25. https://doi.org/10.1609/aaai.v39i12.33357

Guaranteeing Out-Of-Distribution Detection in Deep RL via Transition Estimation. / Prashant, Mohit; Easwaran, Arvind; Das, Suman et al.
Special Track on AI Alignment. ed. / Toby Walsh; Julie Shah; Zico Kolter. 12. ed. Association for the Advancement of Artificial Intelligence, 2025. p. 12452-12460 (Proceedings of the AAAI Conference on Artificial Intelligence; Vol. 39, No. 12).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

TY - GEN

T1 - Guaranteeing Out-Of-Distribution Detection in Deep RL via Transition Estimation

AU - Prashant, Mohit

AU - Easwaran, Arvind

AU - Das, Suman

AU - Yuhas, Michael

PY - 2025/4/11

Y1 - 2025/4/11

N2 - An issue concerning the use of deep reinforcement learning (RL) agents is whether they can be trusted to perform reliably when deployed, as training environments may not reflect real-life environments. Anticipating instances outside their training scope, learning-enabled systems are often equipped with out-of-distribution (OOD) detectors that alert when a trained system encounters a state it does not recognize or in which it exhibits uncertainty. There exists limited work conducted on the problem of OOD detection within RL, with prior studies being unable to achieve a consensus on the definition of OOD execution within the context of RL. By framing our problem using a Markov Decision Process, we assume there is a transition distribution mapping each state-action pair to another state with some probability. Based on this, we consider the following definition of OOD execution within RL: A transition is OOD if its probability during real-life deployment differs from the transition distribution encountered during training. As such, we utilize conditional variational autoencoders (CVAE) to approximate the transition dynamics of the training environment and implement a conformity-based detector using reconstruction loss that is able to guarantee OOD detection with a pre-determined confidence level. We evaluate our detector by adapting existing benchmarks and compare it with existing OOD detection models for RL.

AB - An issue concerning the use of deep reinforcement learning (RL) agents is whether they can be trusted to perform reliably when deployed, as training environments may not reflect real-life environments. Anticipating instances outside their training scope, learning-enabled systems are often equipped with out-of-distribution (OOD) detectors that alert when a trained system encounters a state it does not recognize or in which it exhibits uncertainty. There exists limited work conducted on the problem of OOD detection within RL, with prior studies being unable to achieve a consensus on the definition of OOD execution within the context of RL. By framing our problem using a Markov Decision Process, we assume there is a transition distribution mapping each state-action pair to another state with some probability. Based on this, we consider the following definition of OOD execution within RL: A transition is OOD if its probability during real-life deployment differs from the transition distribution encountered during training. As such, we utilize conditional variational autoencoders (CVAE) to approximate the transition dynamics of the training environment and implement a conformity-based detector using reconstruction loss that is able to guarantee OOD detection with a pre-determined confidence level. We evaluate our detector by adapting existing benchmarks and compare it with existing OOD detection models for RL.

UR - http://www.scopus.com/inward/record.url?scp=105003907479&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=105003907479&partnerID=8YFLogxK

U2 - 10.1609/aaai.v39i12.33357

DO - 10.1609/aaai.v39i12.33357

M3 - Conference contribution

AN - SCOPUS:105003907479

T3 - Proceedings of the AAAI Conference on Artificial Intelligence

SP - 12452

EP - 12460

BT - Special Track on AI Alignment

A2 - Walsh, Toby

A2 - Shah, Julie

A2 - Kolter, Zico

PB - Association for the Advancement of Artificial Intelligence

T2 - 39th Annual AAAI Conference on Artificial Intelligence, AAAI 2025

Y2 - 25 February 2025 through 4 March 2025

ER -

Prashant M, Easwaran A, Das S, Yuhas M. Guaranteeing Out-Of-Distribution Detection in Deep RL via Transition Estimation. In Walsh T, Shah J, Kolter Z, editors, Special Track on AI Alignment. 12 ed. Association for the Advancement of Artificial Intelligence. 2025. p. 12452-12460. (Proceedings of the AAAI Conference on Artificial Intelligence; 12). doi: 10.1609/aaai.v39i12.33357

Guaranteeing Out-Of-Distribution Detection in Deep RL via Transition Estimation

Abstract

Publication series

Conference

Bibliographical note

ASJC Scopus Subject Areas

Access to Document

Other files and links

Fingerprint

Cite this