We consider the popular Partially Observable Monte-Carlo Plan- ning (POMCP) algorithm and propose a methodology, called Active XPOMCP, for generating compact logical rules that represent prop- erties of the control policy. These rules are then used as shields to prevent POMCP from selecting unexpected actions, with useful implications on the security and trustworthiness of the algorithm. Contrary to state-of-the-art methods, Active XPOMCP does not require a previously generated set of belief-action pairs to generate the logical rule, but it actively generates this data in an information- efficient way by querying the algorithm. Active XPOMCP reduces the number of beliefs needed to generate accurate rules with re- spect to state-of-the-art methods, and it allows to produce more accurate shields when few belief-action samples are available.
Active Generation of Logical Rules for POMCP Shielding
Mazzi, G.
Conceptualization
;Castellini, A.Conceptualization
;Farinelli, A.Supervision
2022-01-01
Abstract
We consider the popular Partially Observable Monte-Carlo Plan- ning (POMCP) algorithm and propose a methodology, called Active XPOMCP, for generating compact logical rules that represent prop- erties of the control policy. These rules are then used as shields to prevent POMCP from selecting unexpected actions, with useful implications on the security and trustworthiness of the algorithm. Contrary to state-of-the-art methods, Active XPOMCP does not require a previously generated set of belief-action pairs to generate the logical rule, but it actively generates this data in an information- efficient way by querying the algorithm. Active XPOMCP reduces the number of beliefs needed to generate accurate rules with re- spect to state-of-the-art methods, and it allows to produce more accurate shields when few belief-action samples are available.File | Dimensione | Formato | |
---|---|---|---|
p1696.pdf
accesso aperto
Descrizione: Paper
Tipologia:
Versione dell'editore
Licenza:
Dominio pubblico
Dimensione
981.83 kB
Formato
Adobe PDF
|
981.83 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.