Isolation Forests represent a recent variant of Random Forests, specifically designed for one-class classification problems. In the original version, this method builds a set of extremely randomized trees to describe the set of points, subsequently measuring the “anomaly” of a testing point by looking at how much deep it arrives in each tree. Even if few extensions have been recently proposed – mainly aimed at improving the training stage – in most cases the anomaly score is still kept in its original formulation, which does not completely exploit all the information contained in the trained forest. This paper is focused on improving this aspect, and proposes a new approach for the computation of the anomaly score, which exploits the different information linked to the different nodes of the trees of the forest. We investigate three dif- ferent variants of the novel anomaly score, evaluating them with twelve UCI benchmark datasets, with encouraging results.

A Novel Anomaly Score for Isolation Forests

Mensi, Antonella;Bicego, Manuele
2019-01-01

Abstract

Isolation Forests represent a recent variant of Random Forests, specifically designed for one-class classification problems. In the original version, this method builds a set of extremely randomized trees to describe the set of points, subsequently measuring the “anomaly” of a testing point by looking at how much deep it arrives in each tree. Even if few extensions have been recently proposed – mainly aimed at improving the training stage – in most cases the anomaly score is still kept in its original formulation, which does not completely exploit all the information contained in the trained forest. This paper is focused on improving this aspect, and proposes a new approach for the computation of the anomaly score, which exploits the different information linked to the different nodes of the trees of the forest. We investigate three dif- ferent variants of the novel anomaly score, evaluating them with twelve UCI benchmark datasets, with encouraging results.
2019
978-3-030-30641-0
pattern recognition, random forests
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11562/1017203
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 20
  • ???jsp.display-item.citation.isi??? 18
social impact