PitVis-2023 challenge: Workflow recognition in videos of endoscopic pituitary surgery.

Das, Adrito; He, Junjun; Jund, Antoine; Speidel, Stefanie; Stoyanov, Danail; Vasconcelos, Francisco; Pérez, Alejandra; Khan, Danyal Z; Wu, Jinlin; Hanrahan, John G; Kasai, Satoshi; Zheng, Guoyan; Pang, You; Ye, Jin; Yamlahi, Amine; Kondo, Satoshi; Mazher, Moona; Rivoir, Dominik; Chen, Zhen; Zhang, Yitong; Płotka, Szymon; Arbeláez, Pablo; Godau, Patrick; Qayyum, Abdul; Kaleta, Joanna; Razzak, Imran; Zou, Xiaoyang; Hirasawa, Kousuke; Rodriguez, Santiago; Psychogyios, Dimitrios; Marcus, Hani J; Li, Tianbin; Bano, Sophia

doi:10.1016/j.media.2025.103716

Journal Article

DKFZ-2025-01640

PitVis-2023 challenge: Workflow recognition in videos of endoscopic pituitary surgery.

Das, A. ; Khan, D. Z. ; Psychogyios, D. ; Zhang, Y. ; Hanrahan, J. G. ; Vasconcelos, F. ; Pang, Y. ; Chen, Z. ; Wu, J. ; Zou, X. ; Zheng, G. ; Qayyum, A. ; Mazher, M. ; Razzak, I. ; Li, T. ; Ye, J. ; He, J. ; Płotka, S. ; Kaleta, J. ; Yamlahi, A.DKFZ* ; Jund, A.DKFZ* ; Godau, P.DKFZ* ; Kondo, S. ; Kasai, S. ; Hirasawa, K. ; Rivoir, D. ; Speidel, S. ; Pérez, A. ; Rodriguez, S. ; Arbeláez, P. ; Stoyanov, D. ; Marcus, H. J. ; Bano, S.

2025
Elsevier Science Amsterdam [u.a.]

Medical image analysis 106, 103716 (2025) [10.1016/j.media.2025.103716]

This record in other databases:

Please use a persistent id in citations: doi:10.1016/j.media.2025.103716

Abstract: The field of computer vision applied to videos of minimally invasive surgery is ever-growing. Workflow recognition pertains to the automated recognition of various aspects of a surgery, including: which surgical steps are performed; and which surgical instruments are used. This information can later be used to assist clinicians when learning the surgery or during live surgery. The Pituitary Vision (PitVis) 2023 Challenge tasks the community to step and instrument recognition in videos of endoscopic pituitary surgery. This is a particularly challenging task when compared to other minimally invasive surgeries due to: the smaller working space, which limits and distorts vision; and higher frequency of instrument and step switching, which requires more precise model predictions. Participants were provided with 25-videos, with results presented at the MICCAI-2023 conference as part of the Endoscopic Vision 2023 Challenge in Vancouver, Canada, on 08-Oct-2023. There were 18-submissions from 9-teams across 6-countries, using a variety of deep learning models. The top performing model for step recognition utilised a transformer based architecture, uniquely using an autoregressive decoder with a positional encoding input. The top performing model for instrument recognition utilised a spatial encoder followed by a temporal encoder, which uniquely used a 2-layer temporal architecture. In both cases, these models outperformed purely spatial based models, illustrating the importance of sequential and temporal information. This PitVis-2023 therefore demonstrates state-of-the-art computer vision models in minimally invasive surgery are transferable to a new dataset. Benchmark results are provided in the paper, and the dataset is publicly available at: https://doi.org/10.5522/04/26531686.

Keyword(s): Endoscopic vision ; Instrument recognition ; Step recognition ; Surgical AI ; Surgical vision ; Workflow analysis

Classification:

ddc:610

Contributing Institute(s):

E130 Intelligente Medizinische Systeme (E130)

Research Program(s):

315 - Bildgebung und Radioonkologie (POF4-315) (POF4-315)

Appears in the scientific report 2025

Database coverage:
Medline

; Clarivate Analytics Master Journal List ; Current Contents - Engineering, Computing and Technology ; Ebsco Academic Search ; Essential Science Indicators ; IF >= 10 ; JCR ; SCOPUS ; Science Citation Index Expanded ; Web of Science Core Collection

Click to display QR Code for this record

The record appears in these collections:
Document types > Articles > Journal Article
Public records
Publications database

Record created 2025-08-07, last modified 2025-08-08

Similar records

Rate this document:

(Not yet reviewed)

Add to personal basket
Export as Author List with IDs BibTeX (UTF-8), EndNote XML, EndNote Text, RIS, MARC, Print MARC, MARCXML, DC,
Request correction
Submit fulltext

guest :: login DKFZ
		Search		Submit		Personalize Your alerts Your baskets Your searches		Help