Publication | Open Access
Dual Self-Paced Cross-Modal Hashing
26
Citations
42
References
2024
Year
Cmh MethodsEngineeringMachine LearningMultimodal LearningNatural Language ProcessingInformation RetrievalData SciencePattern RecognitionMulti-task LearningPerceptual HashingMachine VisionFeature LearningKnowledge DiscoveryCross-modal Hashing~Hash FunctionMultimodal Signal ProcessingComputer ScienceDeep LearningComputer VisionDual Difficulty Levels
Cross-modal hashing~(CMH) is an efficient technique to retrieve relevant data across different modalities, such as images, texts, and videos, which has attracted more and more attention due to its low storage cost and fast query speed. Although existing CMH methods achieve remarkable processes, almost all of them treat all samples of varying difficulty levels without discrimination, thus leaving them vulnerable to noise or outliers. Based on this observation, we reveal and study dual difficulty levels implied in cross-modal hashing learning, \ie instance-level and feature-level difficulty. To address this problem, we propose a novel Dual Self-Paced Cross-Modal Hashing (DSCMH) that mimics human cognitive learning to learn hashing from ``easy'' to ``hard'' in both instance and feature levels, thereby embracing robustness against noise/outliers. Specifically, our DSCMH assigns weights to each instance and feature to measure their difficulty or reliability, and then uses these weights to automatically filter out the noisy and irrelevant data points in the original space. By gradually increasing the weights during training, our method can focus on more instances and features from ``easy'' to ``hard'' in training, thus mitigating the adverse effects of noise or outliers. Extensive experiments are conducted on three widely-used benchmark datasets to demonstrate the effectiveness and robustness of the proposed DSCMH over 12 state-of-the-art CMH methods.
| Year | Citations | |
|---|---|---|
Page 1
Page 1