In this technical report, we describe the Guided-Attention mechanism [1] based solution for the short-term anticipation(STA) challenge for the EGO4D challenge. It combines theobject detections, and the spatiotemporal features extractedfrom video clips, enhancing the motion and contextual information, and further decoding the object-centric and motion-centric information to address the problem of STA in egocen-tric videos. For the challenge, we build our model on top ofStillFast [2] with Guided Attention applied on fast network.Our model obtains better performance on the validation setand also achieves state-of-the-art (SOTA) results on the challenge test set for EGO4D Short-Term Object Interaction Anticipation Challenge.

Guided Attention for Next Active Object @ Ego4D Short Term Object Interaction Anticipation Challenge

C. Beyan;V. Murino;A. Del Bue
2023-01-01

Abstract

In this technical report, we describe the Guided-Attention mechanism [1] based solution for the short-term anticipation(STA) challenge for the EGO4D challenge. It combines theobject detections, and the spatiotemporal features extractedfrom video clips, enhancing the motion and contextual information, and further decoding the object-centric and motion-centric information to address the problem of STA in egocen-tric videos. For the challenge, we build our model on top ofStillFast [2] with Guided Attention applied on fast network.Our model obtains better performance on the validation setand also achieves state-of-the-art (SOTA) results on the challenge test set for EGO4D Short-Term Object Interaction Anticipation Challenge.
2023
Object interaction; Anticipation; Egocenric vision
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11562/1122754
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact