Publication | Closed Access
Audio-visual encoding of multimedia content for enhancing movie recommendations
36
Citations
28
References
2018
Year
Unknown Venue
MusicContent DescriptionsEngineeringInformation RetrievalData SciencePattern RecognitionInteractive MultimediaMultimedia AnalysisVideo SummarizationAudio RetrievalMultimedia ContentVarious ModalitiesMultimedia SearchArtsContent AnalysisVideo RetrievalText MiningRank Aggregation Strategy
We propose a multi-modal content-based movie recommender system that replaces human-generated metadata with content descriptions automatically extracted from the visual and audio channels of a video. Content descriptors improve over traditional metadata in terms of both richness (it is possible to extract hundreds of meaningful features covering various modalities) and quality (content features are consistent across different systems and immune to human errors). Our recommender system integrates state-of-the-art aesthetic and deep visual features as well as block-level and i-vector audio features. For fusing the different modalities, we propose a rank aggregation strategy extending the Borda count approach.
| Year | Citations | |
|---|---|---|
Page 1
Page 1