The results demonstrate the effectiveness of the LoViT approach in achieving state-of-the-art performance of surgical phase recognition on two datasets of different surgical procedures and temporal sequencing characteristics.
Authors
S. Ourselin
17 papers
Tom Kamiel Magda Vercauteren
11 papers
Luis C. García-Peraza-Herrera
2 papers
Alejandro Granados
3 papers
Yang Liu
1 papers
Maxence Boels
2 papers
P. Dasgupta
1 papers
References45 items
1
Multi-Stage Spatio-Temporal Aggregation Transformer for Video Person Re-Identification
2
Real-time Online Video Detection with Temporal Smoothing Transformers
3
AutoLaparo: A New Dataset of Integrated Multi-tasks for Image-guided Surgical Automation in Laparoscopic Hysterectomy
4
Exploring Segment-Level Semantics for Online Phase Recognition From Surgical Videos
5
Not End-to-End: Explore Multi-Stage Architecture for Online Surgical Phase Recognition
6
Anticipative Video Transformer
7
Temporal Memory Relation Network for Workflow Recognition From Surgical Video
8
ViViT: A Video Vision Transformer
9
Trans-SVNet: Accurate Phase Recognition from Surgical Videos via Hybrid Embedding Aggregation Transformer
10
OperA: Attention-Regularized Transformers for Surgical Phase Recognition
11
Is Space-Time Attention All You Need for Video Understanding?
12
Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting
13
Machine Learning for Surgical Phase Recognition: A Systematic Review.
14
Surgical Data Science - from Concepts to Clinical Translation
15
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
16
Rescaling Egocentric Vision: Collection, Pipeline and Challenges for EPIC-KITCHENS-100
17
TeCNO: Surgical Phase Recognition with Multi-Stage Temporal Convolutional Networks
18
Automatic Gesture Recognition in Robot-assisted Surgery with Reinforcement Learning and Tree Search
19
PyTorch: An Imperative Style, High-Performance Deep Learning Library
20
CAI4CAI: The Rise of Contextual Artificial Intelligence in Computer-Assisted Interventions
21
Hard Frame Detection and Online Mapping for Surgical Phase Recognition
22
Multi-Task Recurrent Convolutional Network with Correlation Loss for Surgical Video Analysis
23
MS-TCN: Multi-Stage Temporal Convolutional Network for Action Segmentation
24
SV-RCNet: Workflow Recognition From Surgical Videos Using Recurrent Convolutional Network
25
Attention is All you Need
26
Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour
27
Vision-based approaches for surgical activity recognition using laparoscopic and RBGD videos. (Approches basées vision pour la reconnaissance d'activités chirurgicales à partir de vidéos laparoscopiques et multi-vues RGBD)
28
Single- and Multi-Task Architectures for Surgical Workflow Challenge at M2CAI 2016
29
WaveNet: A Generative Model for Raw Audio
30
Temporal Convolutional Networks: A Unified Approach to Action Segmentation
31
Automatic data-driven real-time segmentation and recognition of surgical workflow
32
EndoNet: A Deep Architecture for Recognition Tasks on Laparoscopic Videos
33
Deep Residual Learning for Image Recognition
34
Real-Time Task Recognition in Cataract Surgery Videos Using Adaptive Spatiotemporal Polynomials
35
Feasibility of Real-Time Workflow Segmentation for Tracked Needle Interventions
36
Statistical modeling and recognition of surgical workflow
37
Phase recognition during surgical procedures using embedded and body-worn sensors
38
Modeling and Segmentation of Surgical Workflow from Laparoscopic Video
39
ImageNet: A large-scale hierarchical image database
40
Long Short-Term Memory
41
A tutorial on hidden Markov models and selected applications in speech recognition
42
Dynamic programming algorithm optimization for spoken word recognition
43
GradientBased Learning Applied to Document Recognition
44
Gradient-based learning applied to document recognition
45
Analysis of a complex of statistical variables into principal components.