Multi-sensor concert recording dataset including professional and user-generated content

Werner Bailer, Chris Pike, Rik Bauwens, Reinhard Grandl, Mike Matton, Marcus Thaler
2015 Proceedings of the 6th ACM Multimedia Systems Conference on - MMSys '15  
We present a novel dataset for multi-view video and spatial audio. An ensemble of ten musicians from the BBC Philharmonic Orchestra performed in the orchestra's rehearsal studio in Salford, UK, on 25th March 2014. This presented a controlled environment in which to capture a dataset that could be used to simulate a large event, whilst allowing control over the conditions and performance. The dataset consists of hundreds of video and audio clips captured during 18 takes of performances, using a
more » ... road range of professionaland consumer-grade equipment, up to 4K video and highend spatial microphones. In addition to the audiovisual essence, sensor metadata has been captured, and ground truth annotations, in particular for temporal synchronization and spatial alignment, have been created. A part of the dataset has also been prepared for adaptive content streaming. The dataset is released under a Creative Commons Attribution Non-Commercial Share Alike license and hosted on a specifically adapted content management platform.
doi:10.1145/2713168.2713191 dblp:conf/mmsys/BailerPBGMT15 fatcat:s2vc4tuorfc7xbxh23wmnupake