1
Forest Graph Convolutional Network for Surgical Action Triplet Recognition in Endoscopic Videos
2
Computer vision in surgery: from potential to clinical value
3
Why Deep Surgical Models Fail?: Revisiting Surgical Action Triplet Recognition through the Lens of Robustness
4
AutoLaparo: A New Dataset of Integrated Multi-tasks for Image-guided Surgical Automation in Laparoscopic Hysterectomy
5
Data Splits and Metrics for Method Benchmarking on Surgical Action Triplet Datasets
6
CholecTriplet2021: A benchmark challenge for surgical action triplet recognition
7
SIRNet: Fine-Grained Surgical Interaction Recognition
8
MS-TCT: Multi-Scale Temporal ConvTransformer for Action Detection
9
Efficient Two-Stage Detection of Human-Object Interactions with a Novel Unary-Pairwise Transformer
10
Swin Transformer V2: Scaling Up Capacity and Resolution
11
2020 CATARACTS Semantic Segmentation Challenge
12
Comparative Validation of Machine Learning Algorithms for Surgical Workflow and Skill Analysis with the HeiChole Benchmark
13
Rendezvous: Attention Mechanisms for the Recognition of Surgical Action Triplets in Endoscopic Videos
15
Shallow Feature Matters for Weakly Supervised Object Localization
16
Emerging Properties in Self-Supervised Vision Transformers
17
The SARAS Endoscopic Surgeon Action Detection (ESAD) dataset: Challenges and methods
18
Learning Domain Adaptation with Model Calibration for Surgical Report Generation in Robotic Surgery
19
Temporal Memory Relation Network for Workflow Recognition From Surgical Video
20
MIcro-Surgical Anastomose Workflow recognition challenge report
21
Trans-SVNet: Accurate Phase Recognition from Surgical Videos via Hybrid Embedding Aggregation Transformer
22
QPIC: Query-Based Pairwise Human-Object Interaction Detection with Image-Wide Contextual Information
23
End-to-End Human Object Interaction Detection with HOI Transformer
24
OperA: Attention-Regularized Transformers for Surgical Phase Recognition
25
Endoscopic Vision Challenge 2021
26
Surgical Visual Domain Adaptation: Results from the MICCAI 2020 SurgVisDom Challenge
27
Multi-task temporal convolutional networks for joint recognition of surgical phases and steps in gastric bypass procedures
28
Is Space-Time Attention All You Need for Video Understanding?
29
CholecSeg8k: A Semantic Segmentation Dataset for Laparoscopic Cholecystectomy Based on Cholec80
30
Attention-Driven Dynamic Graph Convolutional Network for Multi-label Image Recognition
31
Comparative validation of multi-instance instrument segmentation in endoscopy: Results of the ROBUST-MIS 2019 challenge
32
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
33
m2caiSeg: Semantic Segmentation of Laparoscopic Images using Convolutional Neural Networks
34
Proposing novel methods for gynecologic surgical action recognition on laparoscopic videos
35
Recognition of Instrument-Tissue Interactions in Endoscopic Videos via Action Triplets
36
Heidelberg colorectal data set for surgical data science in the sensor operating room
37
TeCNO: Surgical Phase Recognition with Multi-Stage Temporal Convolutional Networks
38
MOT20: A benchmark for multi object tracking in crowded scenes
39
Endoscopic Vision Challenge
40
Assisted phase and step annotation for surgical videos
41
2018 Robotic Scene Segmentation Challenge
42
CAI4CAI: The Rise of Contextual Artificial Intelligence in Computer-Assisted Interventions
43
Methods and open-source toolkit for analyzing and visualizing challenge results
44
BIAS: Transparent reporting of biomedical image analysis challenges
45
CaDIS: Cataract Dataset for Image Segmentation
46
2017 Robotic Instrument Segmentation Challenge
47
MOTS: Multi-Object Tracking and Segmentation
48
CATARACTS: Challenge on automatic tool annotation for cataRACT surgery
49
SlowFast Networks for Video Recognition
50
Weakly supervised convolutional LSTM approach for tool tracking in laparoscopic videos
51
Learning from a tiny dataset of manual annotations: a teacher/student approach for surgical phase recognition
52
Learning Human-Object Interactions by Graph Parsing Neural Networks
53
Toward a standard ontology of surgical process models
54
Monitoring tool usage in surgery videos using boosted convolutional and recurrent neural networks
55
Temporal coherence-based self-supervised learning for laparoscopic workflow analysis
56
Weakly-Supervised Learning for Tool Localization in Laparoscopic Videos
57
SV-RCNet: Workflow Recognition From Surgical Videos Using Recurrent Convolutional Network
58
Tool Detection and Operative Skill Assessment in Surgical Videos Using Region-Based Convolutional Neural Networks
59
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification
60
Attention is All you Need
61
AVA: A Video Dataset of Spatio-Temporally Localized Atomic Visual Actions
62
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
63
Detecting and Recognizing Human-Object Interactions
64
Learning to Detect Human-Object Interactions
65
The TUM LapChole dataset for the M2CAI 2016 workflow challenge
66
Learning Models for Actions and Person-Object Interactions with Transfer to Question Answering
67
Hollywood in Homes: Crowdsourcing Data Collection for Activity Understanding
68
The Cityscapes Dataset for Semantic Urban Scene Understanding
69
Object Skeleton Extraction in Natural Images by Fusing Scale-Associated Deep Side Outputs
70
EndoNet: A Deep Architecture for Recognition Tasks on Laparoscopic Videos
71
Deep Residual Learning for Image Recognition
72
HICO: A Benchmark for Recognizing Human-Object Interactions in Images
73
The Multimodal Brain Tumor Image Segmentation Benchmark (BRATS)
74
Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting
75
LapOntoSPM: an ontology for laparoscopic surgeries and its application to surgical phase recognition
76
Visual Semantic Role Labeling
77
Automatic phase prediction from low-level surgical activities
78
A Novel Performance Evaluation Methodology for Single-Target Trackers
79
Learning Spatiotemporal Features with 3D Convolutional Networks
80
Long-term recurrent convolutional networks for visual recognition and description
81
ImageNet Large Scale Visual Recognition Challenge
82
Knowledge-Driven Formalization of Laparoscopic Surgeries for Rule-Based Intraoperative Context-Aware Assistance
83
Large-Scale Video Classification with Convolutional Neural Networks
84
Two-Stream Convolutional Networks for Action Recognition in Videos
85
Microsoft COCO: Common Objects in Context
86
Surgical process modelling: a review
87
UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild
88
Linking Top-Level Ontologies and Surgical Workflows
89
Structured recording of intraoperative surgical workflows
90
Towards automatic skill evaluation: Detection and segmentation of robot-assisted surgical motions
91
Deliberate Perioperative Systems Design Improves Operating Room Throughput
92
ENT-surgical workflow as an instrument to assess the efficiency of technological developments in medicine
93
The 2005 PASCAL Visual Object Classes Challenge
95
Instrument-tissue Interaction Quintuple Detection in Surgery Videos
96
VisDrone-MOT2020: The Vision Meets Drone Multiple Object Tracking Challenge Results
97
OR 2.0 Context-Aware Operating Theaters, Computer Assisted Robotic Endoscopy, Clinical Image-Based Procedures, and Skin Image Analysis
98
Action Recognition in Realistic Sports Videos
99
JHU-ISI Gesture and Skill Assessment Working Set ( JIGSAWS ) : A Surgical Activity Dataset for Human Motion Modeling
100
Motif Discovery in OR Sensor Data with Application to Surgical Workflow Analysis and Activity Detection
101
Recognition of the Surgeon's Motions During Endoscopic Operation by Statistics Based Algorithm and Neural Networks Based ANARX Models
102
International Journal of Computer Vision manuscript No. (will be inserted by the editor) The PASCAL Visual Object Classes (VOC) Challenge