Automated Metadata Enrichment of Large Speech Radio Archives

Y. Raimond, C. Lowis, R. Hodgson, D. Tinley
2014 SMPTE Motion Imaging Journal  
The BBC is currently tagging programmes manually, using DBpedia as a source of tag identifiers, and a list of suggested tags extracted from their synopsis. These tags are then used to help navigation and topic-based search of BBC programmes. However, given the very large number of programmes available in the archive, most of them having very little metadata attached to them, we need a way of automatically assigning tags to programmes. We describe a framework to do so, using speech recognition,
more » ... ext processing and concept tagging techniques. We evaluate this framework against manually applied tags, and compare it with related work. We find that this framework is good enough to bootstrap the interlinking process of archived content.
doi:10.5594/j18370xy fatcat:g5dql24le5c5noeifruiikiyoe