Publication | Open Access
Multi-Frame Feature Aggregation for Real-Time Instrument Segmentation in Endoscopic Video
23
Citations
36
References
2021
Year
Convolutional Neural NetworkEngineeringMachine LearningVideo ProcessingSurgeryVideo InterpretationImage Sequence AnalysisImage AnalysisData SciencePattern RecognitionVideo Content AnalysisVideo TransformerRadiologySurgical Instrument SegmentationMachine VisionMedical ImagingFeature LearningDeep Learning-based MethodsComputer ScienceVideo UnderstandingDeep LearningMedical Image ComputingComputer VisionReal-time Instrument SegmentationVideo AnalysisMedicineDeep Feature ExtractionImage SegmentationMotion Analysis
Deep learning-based methods have achieved promising results on surgical instrument segmentation. However, the high computation cost may limit the application of deep models to time-sensitive tasks such as online surgical video analysis for robotic-assisted surgery. Moreover, current methods may still suffer from challenging conditions in surgical images such as various lighting conditions and the presence of blood. We propose a novel Multi-frame Feature Aggregation (MFFA) module to aggregate video frame features temporally and spatially in a recurrent mode. By distributing the computation load of deep feature extraction over sequential frames, we can use a lightweight encoder to reduce the computation costs at each time step. Moreover, public surgical videos usually are not labeled frame by frame, so we develop a method that can randomly synthesize a surgical frame sequence from a single labeled frame to assist network training. We demonstrate that our approach achieves superior performance to corresponding deeper segmentation models on two public surgery datasets.
| Year | Citations | |
|---|---|---|
Page 1
Page 1