1
MATIS: Masked-Attention Transformers for Surgical Instrument Segmentation
2
Adaptive t-vMF Dice Loss for Multi-class Medical Image Segmentation
3
Neural Rendering for Stereo 3D Reconstruction of Deformable Tissues in Robotic Surgery
4
MSDESIS: Multi-task stereo disparity estimation and surgical instrument segmentation
5
SSIS-Seg: Simulation-Supervised Image Synthesis for Surgical Instrument Segmentation
6
PATG: position-aware temporal graph networks for surgical phase recognition on laparoscopic videos
7
Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos
8
TraSeTR: Track-to-Segment Transformer with Contrastive Query for Instance-level Instrument Segmentation in Robotic Surgery
9
Gesture Recognition in Robotic Surgery with Multimodal Attention
10
Transformers in Medical Imaging: A Survey
11
Masked-attention Mask Transformer for Universal Image Segmentation
12
ASFormer: Transformer for Action Segmentation
13
ActionCLIP: A New Paradigm for Video Action Recognition
14
Towards accurate and interpretable surgical skill assessment: a video-based method for skill score prediction and guiding feedback generation
15
Ranger21: a synergistic deep learning optimizer
16
Multiscale Vision Transformers
17
EfficientNetV2: Smaller Models and Faster Training
18
Trans-SVNet: Accurate Phase Recognition from Surgical Videos via Hybrid Embedding Aggregation Transformer
19
Co-Generation and Segmentation for Generalized Surgical Instrument Segmentation on Unlabelled Data
20
Simulation-to-real domain adaptation with teacher–student learning for endoscopic instrument segmentation
21
Multi-Class Detection of Laparoscopic Instruments for the Intelligent Box-Trainer System Using Faster R-CNN Architecture
22
Relational Graph Learning on Visual and Kinematics Embeddings for Accurate Gesture Recognition in Robotic Surgery
23
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
24
Computer Vision in the Surgical Operating Room
25
daVinciNet: Joint Prediction of Motion and Surgical State in Robot-Assisted Surgery
26
Synthetic and Real Inputs for Tool Segmentation in Robotic Surgery
27
A comparative analysis of multi-backbone Mask R-CNN for surgical tools detection
28
Surgical tool segmentation and localization using spatio-temporal deep network
29
2018 Robotic Scene Segmentation Challenge
30
Object-Contextual Representations for Semantic Segmentation
31
A DVRK-based Framework for Surgical Subtask Automation
32
On the Variance of the Adaptive Learning Rate and Beyond
33
Incorporating Temporal Prior from Motion Flow for Instrument Segmentation in Minimally Invasive Surgery Video
34
Multi-Task Recurrent Convolutional Network with Correlation Loss for Surgical Video Analysis
35
Learning Where to Look While Tracking Instruments in Robot-assisted Surgery
36
High-Resolution Representations for Labeling Pixels and Regions
37
MS-TCN: Multi-Stage Temporal Convolutional Network for Action Segmentation
38
2017 Robotic Instrument Segmentation Challenge
39
UPSNet: A Unified Panoptic Segmentation Network
40
SlowFast Networks for Video Recognition
41
UNet++: A Nested U-Net Architecture for Medical Image Segmentation
42
Why rankings of biomedical image analysis competitions should be interpreted with care
43
SV-RCNet: Workflow Recognition From Surgical Videos Using Recurrent Convolutional Network
44
Automatic Instrument Segmentation in Robot-Assisted Surgery Using Deep Learning
45
Knowledge-based support for surgical workflow analysis and recognition
46
Decoupled Weight Decay Regularization
47
Generalised Dice overlap as a deep learning loss function for highly unbalanced segmentations
48
ToolNet: Holistically-nested real-time segmentation of robotic surgical tools
49
LinkNet: Exploiting encoder representations for efficient semantic segmentation
50
Attention is All you Need
51
The Kinetics Human Action Video Dataset
52
Feature Pyramid Networks for Object Detection
53
Xception: Deep Learning with Depthwise Separable Convolutions
54
Temporal Convolutional Networks: A Unified Approach to Action Segmentation
55
SGDR: Stochastic Gradient Descent with Warm Restarts
56
Deep Residual Learning for Image Recognition
57
U-Net: Convolutional Networks for Biomedical Image Segmentation
58
Adam: A Method for Stochastic Optimization
59
Microsoft COCO: Common Objects in Context
60
Combining embedded accelerometers with computer vision for recognizing food preparation activities
61
ImageNet: A large-scale hierarchical image database
64
$I$-Divergence Geometry of Probability Distributions and Minimization Problems
65
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
66
Seg-former: Simple and e ffi cient design for semantic segmentation with trans-formers
67
Robust deep learning-based semantic organ segmentation in hyperspectral images
68
Informer: Beyond e ffi cient transformer for long sequence time-series forecasting
69
E ffi cientnet: Rethinking model scaling for convolutional neural networks
70
JHU-ISI Gesture and Skill Assessment Working Set ( JIGSAWS ) : A Surgical Activity Dataset for Human Motion Modeling
71
The Chinese University of Hong Kong , Hong Kong , China
72
Reggio Emilia, Modena, Italy
74
Parts of instrumentation that are fully submerged in fluids are not annotated (Fig. 2d)
75
Claspers that have holes, (i.e. Cadiere forceps and Pro-Grasp forceps), are labeled as if they were not perforated (Fig. 2b)
76
e) Tool parts near the edge of frames that are not clearly visible due to illumination, are not annotated (Fig. 2e)
77
2022. Towards holistic surgical scene understanding
78
When fluids occlude surgical instrumentation by floating on top of or away from them, masks are defined to approximate the expected shape of the occluded object (Fig. 2c)