Publication | Closed Access
Cross-Modal Subspace Learning via Pairwise Constraints
76
Citations
59
References
2015
Year
Pairwise ConstraintEngineeringMachine LearningMultimodal LearningVideo RetrievalNatural Language ProcessingImage AnalysisInformation RetrievalData ScienceText-to-image RetrievalPattern RecognitionLow-rank ApproximationManifold LearningDifferent ModalitiesComputer ScienceImage SimilarityDeep LearningComputer VisionCompound ℓ21 RegularizationPairwise Constraints
In multimedia applications, the text and image components in a web document form a pairwise constraint that potentially indicates the same semantic concept. This paper studies cross-modal learning via the pairwise constraint and aims to find the common structure hidden in different modalities. We first propose a compound regularization framework to address the pairwise constraint, which can be used as a general platform for developing cross-modal algorithms. For unsupervised learning, we propose a multi-modal subspace clustering method to learn a common structure for different modalities. For supervised learning, to reduce the semantic gap and the outliers in pairwise constraints, we propose a cross-modal matching method based on compound ℓ21 regularization. Extensive experiments demonstrate the benefits of joint text and image modeling with semantically induced pairwise constraints, and they show that the proposed cross-modal methods can further reduce the semantic gap between different modalities and improve the clustering/matching accuracy.
| Year | Citations | |
|---|---|---|
Page 1
Page 1