Publication | Closed Access
Boosting Few-Shot Learning With Adaptive Margin Loss
196
Citations
28
References
2020
Year
Unknown Venue
Artificial IntelligenceFew-shot LearningEngineeringMachine LearningMeta-learningNatural Language ProcessingImage AnalysisZero-shot LearningData SciencePattern RecognitionMulti-task LearningSupervised LearningMachine VisionKnowledge DiscoveryComputer ScienceDeep LearningGeneralization AbilityComputer VisionAdaptive Margin LossSemantic Similarity
Few-shot learning remains challenging because generalizing from very few examples is intrinsically difficult. The study proposes an adaptive margin principle to enhance the generalization of metric-based meta-learning for few-shot learning. They introduce a class-relevant additive margin loss that uses semantic similarity between class pairs to separate samples in the embedding space, and a task-relevant additive margin loss that incorporates semantic context among all classes in a sampled training task, enabling extension to generalized FSL. Experiments show that the adaptive margin method improves performance of metric-based meta-learning in both standard and generalized few-shot learning.
Few-shot learning (FSL) has attracted increasing attention in recent years but remains challenging, due to the intrinsic difficulty in learning to generalize from a few examples. This paper proposes an adaptive margin principle to improve the generalization ability of metric-based meta-learning approaches for few-shot learning problems. Specifically, we first develop a class-relevant additive margin loss, where semantic similarity between each pair of classes is considered to separate samples in the feature embedding space from similar classes. Further, we incorporate the semantic context among all classes in a sampled training task and develop a task-relevant additive margin loss to better distinguish samples from different classes. Our adaptive margin method can be easily extended to a more realistic generalized FSL setting. Extensive experiments demonstrate that the proposed method can boost the performance of current metric-based meta-learning approaches, under both the standard FSL and generalized FSL settings.
| Year | Citations | |
|---|---|---|
Page 1
Page 1