We consider the popular Partially Observable Monte-Carlo Plan- ning (POMCP) algorithm and propose a methodology, called Active XPOMCP, for generating compact logical rules that represent prop- erties of the control policy. These rules are then used as shields to prevent POMCP from selecting unexpected actions, with useful implications on the security and trustworthiness of the algorithm. Contrary to state-of-the-art methods, Active XPOMCP does not require a previously generated set of belief-action pairs to generate the logical rule, but it actively generates this data in an information- efficient way by querying the algorithm. Active XPOMCP reduces the number of beliefs needed to generate accurate rules with re- spect to state-of-the-art methods, and it allows to produce more accurate shields when few belief-action samples are available.

Active Generation of Logical Rules for POMCP Shielding

Mazzi, G.
Conceptualization
;
Castellini, A.
Conceptualization
;
Farinelli, A.
Supervision
2022-01-01

Abstract

We consider the popular Partially Observable Monte-Carlo Plan- ning (POMCP) algorithm and propose a methodology, called Active XPOMCP, for generating compact logical rules that represent prop- erties of the control policy. These rules are then used as shields to prevent POMCP from selecting unexpected actions, with useful implications on the security and trustworthiness of the algorithm. Contrary to state-of-the-art methods, Active XPOMCP does not require a previously generated set of belief-action pairs to generate the logical rule, but it actively generates this data in an information- efficient way by querying the algorithm. Active XPOMCP reduces the number of beliefs needed to generate accurate rules with re- spect to state-of-the-art methods, and it allows to produce more accurate shields when few belief-action samples are available.
2022
POMDP, POMCP, Shielding, SMT
File in questo prodotto:
File Dimensione Formato  
p1696.pdf

accesso aperto

Descrizione: Paper
Tipologia: Versione dell'editore
Licenza: Dominio pubblico
Dimensione 981.83 kB
Formato Adobe PDF
981.83 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11562/1122348
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? ND
social impact