A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2021; you can also visit the original URL.
The file type is application/pdf
.
Challenges of releasing audio material for spoken data: The case of the London-Lund Corpus 2
2021
Research in Corpus Linguistics
This article aims to describe key challenges of preparing and releasing audio material for spoken data and to propose solutions to these challenges. We draw on our experience of compiling the new London-Lund Corpus 2 (LLC-2), where transcripts are released together with the audio files. However, making the audio material publicly available required careful consideration of how to, most effectively, 1) align the transcripts with the audio and 2) anonymise personal information in the recordings.
doi:10.32714/ricl.09.01.04
fatcat:fr5fmmerujdwrhg3xon3kgkshi