A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2021; you can also visit the original URL.
The file type is application/pdf
.
End-to-End Optimization of Source Models for Speech and Audio Coding Using a Machine Learning Framework
2019
Interspeech 2019
Speech coding is the most commonly used application of speech processing. Accumulated layers of improvements have however made codecs so complex that optimization of individual modules becomes increasingly difficult. This work introduces machine learning methodology to speech and audio coding, such that we can optimize quality in terms of overall entropy. We can then use conventional quantization, coding and perceptual models without modification such that the codec adheres to conventional
doi:10.21437/interspeech.2019-1284
dblp:conf/interspeech/Backstrom19
fatcat:6ciiqymzungj5n7lzqtrdldovm