CATALOGO DEI PRODOTTI DELLA RICERCA

In state-of-the-art deep single-label classification models, the top-k (k = 2,3,4,....) accuracy is usually significantly higher than the top-1 accuracy. This is more evident in fine-grained datasets, where differences between classes are quite subtle. Exploiting the information provided in the top k predicted classes boosts the final prediction of a model. We propose Guided Zoom, a novel way in which explainabitity could be used to improve model performance. We do so by making sure the model has "the right reasons" fora prediction. The reason/evidence upon which a deep neural network makes a prediction is defined to be the grounding, in the pixel space, for a specific class conditional probability in the model output. Guided Zoom examines how reasonable the evidence used to make each of the top-k predictions is. Test time evidence is deemed reasonable if it is coherent with evidence used to make similar correct decisions at training time. This leads to better informed predictions. We explore a variety of grounding techniques and study their complementarity for computing evidence. We show that Guided Zoom results in an improvement of a model's classification accuracy and achieves state-of-the-art classification performance on four fine-grained classification datasets.

Guided Zoom: Zooming into Network Evidence to Refine Fine-Grained Model Decisions

Bargal, SA;Zunino, A;Petsiuk, V;Zhang, JM;Saenko, K;Murino, V;Sclaroff, S

2021-01-01

Abstract

In state-of-the-art deep single-label classification models, the top-k (k = 2,3,4,....) accuracy is usually significantly higher than the top-1 accuracy. This is more evident in fine-grained datasets, where differences between classes are quite subtle. Exploiting the information provided in the top k predicted classes boosts the final prediction of a model. We propose Guided Zoom, a novel way in which explainabitity could be used to improve model performance. We do so by making sure the model has "the right reasons" fora prediction. The reason/evidence upon which a deep neural network makes a prediction is defined to be the grounding, in the pixel space, for a specific class conditional probability in the model output. Guided Zoom examines how reasonable the evidence used to make each of the top-k predictions is. Test time evidence is deemed reasonable if it is coherent with evidence used to make similar correct decisions at training time. This leads to better informed predictions. We explore a variety of grounding techniques and study their complementarity for computing evidence. We show that Guided Zoom results in an improvement of a model's classification accuracy and achieves state-of-the-art classification performance on four fine-grained classification datasets.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2021
			
	Parole chiave
	
				Explainable AI
grounding
saliency
fine-grained image classification
classification refinement
convolutional neural networks
			
	Appare nelle tipologie:
	
				01.01 Articolo in Rivista

File in questo prodotto:

File	Dimensione	Formato
paper_guided-zoom.pdf solo utenti autorizzati Tipologia: Documento in Post-print Licenza: Non specificato Dimensione 8.75 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	8.75 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11562/1060737

Citazioni

ND

26

22

social impact