Concepedia

TLDR

The temporal and multi‑modal nature of video expands the dimensionality of content‑based retrieval, creating new demands on indexing and retrieval tools that VVE addresses with its framework and basic primitives. This paper demonstrates the application of the VVE framework to content‑based video retrieval. VVE is a flexible, platform‑independent architecture that processes synchronized image, audio, and caption streams and supports multi‑modal indexing and retrieval through media‑specific primitives.

Abstract

The temporal and multi-modal nature of video increases the dimensionality of content based retrieval problem. This places new demands on the indexing and retrieval tools required. The Virage Video Engine (VVE) with the default set of primitives provide the necessary frame work and basic tools for video content based retrieval. The video engine is a flexible platform independent architecture which provides support for processing multiple synchronized data streams like image sequences, audio and closed captions. The architecture allows for multi-modal indexing and retrieval of video through the use of media specific primitives. This paper presents the use of the VVE framework for content based video retrieval.