European Projects

Vidi-Video: Interactive semantic video search with a large thesaurus of machine-learned audio-visual concepts

On Thursday 13th, a meeting of the Vidi-Video is scheduled.

VIDI-Video project takes on the challenge of creating a substantially enhanced semantic access to video, implemented in a search engine. The engine will boost the performance of video search by forming a 1000 element thesaurus detecting instances of audio, visual or mixed-media content. Concrete outputs will be a fully implemented audio-visual search engine, consisting of two main parts, viz. a learning system and a runtime system, where the former will feed its results into the latter after each round of training-and-thesaurus-update. The learning system will consist of software to be developed for overall video processing; visual analysis; audio analysis; integrated feature detector; and multimedia query and user interface.

The key objectives of this project are:

  1. to build a large scale thesaurus well-spread over the semantic clues
  2. to design, adapt and evaluate methods to learn large thesauri of detectors
  3. to define and evaluate powerful sets of visual, audio, and cross-modal invariant features
  4. to deliver effective interaction with the user
  5. to evaluate the approach in relevant application areas