A/V Analyzing Toolbox

Fraunhofer Institute for Digital Media Technology

© istock.com/Anthony Brown

Automated Audio and Video Analysis

A/V Analyzing Toolbox is a modular content analysis framework which allows comprehensive automated processes to be applied to audio-visual content for the optimization of broadcasting, distribution and archiving workflows. It inspects A/V data with respect to technical and perceptual parameters and extracts metadata, as well as semantic information. A/V Analyzing Toolbox provides components for automated detection of errors and quality issues in A/V content. The video analysis components enable e.g. detailed temporal video segmentation and motion analysis for browsing, management, and synchronization of content.


A/V Error Detection Libraries

The QC components enable the detection of errors and quality issues in A/V content for automatic quality control.

Temporal Video Segmentation

The Temporal Video Segmentation detects and compiles shots, representative key frames, and scenes in movies or movie clips for browsing, exploration, management, and content presentation.

Video Motion Analysis

The Video Motion Analysis detects motion in videos, e.g. pan, tilt, zoom, or camera unsteadiness. Based on motion detection, further video analysis approaches are applied, such as visual rhythm pattern analysis, e.g. inner rhythm classification: fluid, staccato, or no motion.

Semantic Video Analysis

The semantic video analysis enables automated tagging of video segments through the classification of visual concepts or classes ranging from time of day, landscape (e.g. beach or forest) and mood. The system can be trained to analyze different concepts or class types based upon the unique needs of the customer. The semantic video analysis is also capable of generating recommendations for different concepts based upon the customer’s use. The functionality of the software is helpful for browsing or searching in large video archives.

Speech/Music Discrimination

The automatic Speech/Music Discrimination segments audio content with respect to containing speech, music, speech and background music, and silence, which can be used for automatic music royalty reporting, cue sheet generation, content search, and as a preparatory step to improve robust content identification.

Technical Information

  • Universally applicable signal analysis framework for A/V content
  • Analysis methods encapsulated in dedicated modules
  • Modular architecture supporting customization to individual requirements
  • Components available as C++ libraries for Windows, Linux and OS X for integration into existing systems
  • Evaluation versions available upon request

The A/V Analyzing Toolbox is being continuously enhanced. If you have specific requirements, or would like more information please contact us.

Social Bookmarks