A reduced yet extensible audio-visual description language

Rapha�l Troncy, Jean Carrive
2004 Proceedings of the 2004 ACM symposium on Document engineering - DocEng '04  
Enabling an intelligent access to multimedia data requires a powerful description language. In this paper, we demonstrate why the MPEG-7 standard fails to fulfill this task. We introduce then our proposition: an audio-visual specific description language, modular, reduced, but designed to be extensible. This language is centered on the notions of descriptor and structure with a well-defined semantics. A descriptor can be a low-level feature, automatically extracted from the signal, or a higher
more » ... emantic concept that will be used to annotate the video documents. The descriptors can be combined into structures according to defined models that provide description patterns.
doi:10.1145/1030397.1030415 dblp:conf/doceng/TroncyC04 fatcat:dsex64gm65cq7cm665jvf7tu5a