Purpose: Automation in robot-assisted surgery (RAS) requires not only accurate scene understanding but also real-time reasoning and action within dynamic surgical workflows. This work introduces SPARTAN: the Surgical Peg-And-Ring Triplet and Workflow ANticipation Benchmark, alongside a unified baseline for real-time surgical workflow analysis, for the first time jointly addressing surgical phase recognition, phase anticipation, and action triplet recognition. This integrated design bridges high-level workflow understanding with fine-grained, robot-action-level perception. Methods: The SPARTAN benchmark is based on a modified Peg-and-Ring training task performed on the da Vinci Research Kit (dVRK), providing frame-level annotations of surgical phases and dual-arm action triplets that delineate initial, intermediate, and final workflow states. Results: We demonstrate that our baseline achieves performance comparable to state-of-the-art methods across all three SPARTAN tasks while operating in real time. The benchmark offers complexity comparable to related datasets in terms of phase structure, number of videos, and triplet diversity, yet remains reproducible and directly applicable to physical robotic systems. Conclusion: SPARTAN provides a practical foundation for developing and evaluating real-time perception and reasoning models in RAS.

Spartan: surgical peg-and-ring triplet and workflow anticipation benchmark

Cunico, Federico;Sandrini, Michele;Piccinelli, Nicola;Muradore, Riccardo
2026-01-01

Abstract

Purpose: Automation in robot-assisted surgery (RAS) requires not only accurate scene understanding but also real-time reasoning and action within dynamic surgical workflows. This work introduces SPARTAN: the Surgical Peg-And-Ring Triplet and Workflow ANticipation Benchmark, alongside a unified baseline for real-time surgical workflow analysis, for the first time jointly addressing surgical phase recognition, phase anticipation, and action triplet recognition. This integrated design bridges high-level workflow understanding with fine-grained, robot-action-level perception. Methods: The SPARTAN benchmark is based on a modified Peg-and-Ring training task performed on the da Vinci Research Kit (dVRK), providing frame-level annotations of surgical phases and dual-arm action triplets that delineate initial, intermediate, and final workflow states. Results: We demonstrate that our baseline achieves performance comparable to state-of-the-art methods across all three SPARTAN tasks while operating in real time. The benchmark offers complexity comparable to related datasets in terms of phase structure, number of videos, and triplet diversity, yet remains reproducible and directly applicable to physical robotic systems. Conclusion: SPARTAN provides a practical foundation for developing and evaluating real-time perception and reasoning models in RAS.
2026
Surgical workflow anticipation, Surgical action triplet recognition, Surgical workflow analysis
File in questo prodotto:
File Dimensione Formato  
Cunico_et_al-2026-International_Journal_of_Computer_Assisted_Radiology_and_Surgery.pdf

accesso aperto

Licenza: Dominio pubblico
Dimensione 568.26 kB
Formato Adobe PDF
568.26 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11562/1191447
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact