Publication | Closed Access
Semi-Supervised Random Forests
132
Citations
18
References
2009
Year
Unknown Venue
Artificial IntelligenceMultiple Instance LearningEngineeringMachine LearningImage AnalysisData ScienceData MiningPattern RecognitionDecision Tree LearningSemi-supervised Random ForestsSemi-supervised LearningSupervised LearningMulti-class MarginMachine VisionPredictive AnalyticsKnowledge DiscoveryComputer ScienceDeep LearningMedical Image ComputingComputer VisionRandom ForestsClassifier System
Random Forests (RFs) have become commonplace in many computer vision applications. Their popularity is mainly driven by their high computational efficiency during both training and evaluation while still being able to achieve state-of-the-art accuracy. This work extends the usage of Random Forests to Semi-Supervised Learning (SSL) problems. We show that traditional decision trees are optimizing multi-class margin maximizing loss functions. From this intuition, we develop a novel multi-class margin definition for the unlabeled data, and an iterative deterministic annealing-style training algorithm maximizing both the multi-class margin of labeled and unlabeled samples. In particular, this allows us to use the predicted labels of the unlabeled data as additional optimization variables. Furthermore, we propose a control mechanism based on the out-of-bag error, which prevents the algorithm from degradation if the unlabeled data is not useful for the task. Our experiments demonstrate state-of-the-art semi-supervised learning performance in typical machine learning problems and constant improvement using unlabeled data for the Caltech-101 object categorization task.
| Year | Citations | |
|---|---|---|
Page 1
Page 1