A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation
[article]
2020
arXiv
pre-print
Stereophonic audio is an indispensable ingredient to enhance human auditory experience. Recent research has explored the usage of visual information as guidance to generate binaural or ambisonic audio from mono ones with stereo supervision. However, this fully supervised paradigm suffers from an inherent drawback: the recording of stereophonic audio usually requires delicate devices that are expensive for wide accessibility. To overcome this challenge, we propose to leverage the vastly
arXiv:2007.09902v1
fatcat:ucqdbcdwynhgjg6eqdlwyivip4