Publication | Closed Access
Learning Multiscale Deep Features and SVM Regressors for Adaptive RGB-T Saliency Detection
26
Citations
10
References
2017
Year
Unknown Venue
Convolutional Neural NetworkEngineeringFeature DetectionMachine LearningRgb-t Saliency DetectionImage ClassificationImage AnalysisData SciencePattern RecognitionSvm RegressorsImagenet DatasetVideo TransformerVision RecognitionMachine VisionObject DetectionVision Language ModelDeep LearningDifferent Modality InputsComputer VisionScene UnderstandingMultiscale Deep Features
This paper investigates how to perform robust image saliency detection by adaptively leveraging different source data. Given the aligned RGB-T image pair, we learn the robust representations for each modality by using deep convolutional neural networks (CNNs) at different scales, which can capture multiscale context features and rich semantic information inherited from the previous CNNs trained on the ImageNet Dataset. Then, we employ fully connected neural network layer to concatenate multiscale CNN features, and infer the saliency map for each modality. For adaptively incorporating the information from RGB and thermal images, we train a SVM regressor on the multiscale CNN features to compute the reliability weight of each modality, and combine them with the corresponding saliency maps to achieve the fused saliency map. In addition, we create a new image dataset and implement some baseline methods with different modality inputs for facilitating the evaluations of RGB-T saliency detection. Experimental results on the newly created dataset demonstrate the effectiveness of the proposed approach against other baseline methods.
| Year | Citations | |
|---|---|---|
Page 1
Page 1