MIRACLE at VideoCLEF 2008: Classification of Multilingual Speech Transcripts

Julio Villena-Román, Sara Lana-Serrano
2008 Conference and Labs of the Evaluation Forum  
This paper describes the participation of MIRACLE research consortium at the VideoCLEF track at CLEF 2008. We took part in both the main mandatory Classification task that consists in classifying videos of television episodes using speech transcripts and metadata, and the Keyframe Extraction task, whose objective is to select keyframes that represent individual episodes from a set of supplied keyframes (one from each shot of the video source). For the first task, our system is composed of two
more » ... in blocks, the first in charge of building the core system knowledge base, and then the set of operational elements that are needed to classify the speech transcripts of the topic episodes and generate the output in RSS format. For the second task, our approach is based on the assumption that the most representative fragment (shot) of each episode is the one whose distance to the whole episode is the lowest, considering a vector space model. 4 runs were submitted in all. Regarding the classification task, we ranked 3 rd (out of 6 participants) in terms of precision and 2 nd in terms of recall.
dblp:conf/clef/Villena-RomanL08a fatcat:q7igqx3i4nghlpbfi2b76eicye