Filters








4,266 Hits in 4.5 sec

A Probabilistic Model for Robust Localization Based on a Binaural Auditory Front-End

Tobias May, Steven van de Par, Armin Kohlrausch
<span title="">2011</span> <i title="Institute of Electrical and Electronics Engineers (IEEE)"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/zcz4ey2iwffxtgaodtf5jtebmy" style="color: black;">IEEE Transactions on Audio, Speech, and Language Processing</a> </i> &nbsp;
Although extensive research has been done in the field of machine-based localization, the degrading effect of reverberation and the presence of multiple sources on localization performance has remained  ...  Multiconditional training is performed to take into account the variability of the binaural features which results from multiple sources and the effect of reverberation.  ...  Park, and the anonymous reviewers for their useful comments that have greatly improved the clarity of this paper.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/tasl.2010.2042128">doi:10.1109/tasl.2010.2042128</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/y4sk5jwudndhjl6yoyeb6iagxq">fatcat:y4sk5jwudndhjl6yoyeb6iagxq</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170829071713/http://orbit.dtu.dk/files/53464002/05406118.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/c9/c6/c9c69cda789f345171703a5201b8dfbf9acbdbcf.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/tasl.2010.2042128"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

Binaural Localization of Multiple Sound Sources by Non-Negative Tensor Factorization

Elie Laurent Benaroya, Nicolas Obin, Marco Liuni, Axel Roebel, Wilson Raumel, Sylvain Argentieri
<span title="">2018</span> <i title="Institute of Electrical and Electronics Engineers (IEEE)"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/rut5unc4enborm7fhpkwgeza7m" style="color: black;">IEEE/ACM Transactions on Audio Speech and Language Processing</a> </i> &nbsp;
This paper presents non-negative factorization of audio signals for the binaural localization of multiple sound sources within realistic and unknown sound environments.  ...  The proposed NTFbased sound source localization is here applied to binaural sound source localization of multiple speakers within realistic sound environments.  ...  The addition of a noise source model at a fixed position substantially improves the sound source localization.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/taslp.2018.2806745">doi:10.1109/taslp.2018.2806745</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/wbokq2b7brecfnbgbm5rxn2cba">fatcat:wbokq2b7brecfnbgbm5rxn2cba</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200210225416/https://hal.sorbonne-universite.fr/hal-01722004/file/Binaural_Localization_of_Multiple_Sound.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/3f/c9/3fc995e2352c1b91bb02f3e156d005751b4ed100.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/taslp.2018.2806745"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

Binaural Localization of Multiple Sources in Reverberant and Noisy Environments

John Woodruff, DeLiang Wang
<span title="">2012</span> <i title="Institute of Electrical and Electronics Engineers (IEEE)"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/zcz4ey2iwffxtgaodtf5jtebmy" style="color: black;">IEEE Transactions on Audio, Speech, and Language Processing</a> </i> &nbsp;
We also propose a flexible azimuth-dependent model of binaural features that independently captures characteristics of the binaural setup and environmental conditions, allowing for adaptation to new environments  ...  We demonstrate performance improvement relative to binaural only methods assuming a known number of spatially stationary sources.  ...  May for making implementations of their algorithms available, and C. Hummersone for making the set of measured impulse responses available.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/tasl.2012.2183869">doi:10.1109/tasl.2012.2183869</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/ymvqys4d7rfado2ehc6xdfl46q">fatcat:ymvqys4d7rfado2ehc6xdfl46q</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170829112958/http://web.cse.ohio-state.edu/~wang.77/papers/Woodruff-Wang.taslp12.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/07/81/0781d05c66619aa5da66457934693b4463b7f348.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/tasl.2012.2183869"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

Robust localization and tracking of multiple speakers in real environments for binaural robot audition

Ui-Hyun Kim, Hiroshi G. Okuno
<span title="">2013</span> <i title="IEEE"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/o53yzhxhwbemxownsy4lul2sma" style="color: black;">2013 14th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS)</a> </i> &nbsp;
consisting of voice activity detection (VAD) and K-means clustering algorithm for binaural robot audition.  ...  The standard K-means clustering algorithm was improved for the purpose of multisource speech tracking by adding two additional steps.  ...  Among the various functions required for binaural robot audition, sound source localization (SSL) is one of the most important techniques to achieve more natural and intelligent human-robot interaction  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/wiamis.2013.6616137">doi:10.1109/wiamis.2013.6616137</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/wiamis/KimO13.html">dblp:conf/wiamis/KimO13</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/6j7zlbsugva4jb3fjdmw3t7ztm">fatcat:6j7zlbsugva4jb3fjdmw3t7ztm</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170705111005/http://winnie.kuis.kyoto-u.ac.jp/members/okuno/Public/WIAMIS2013-Kim.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/3a/56/3a5618a2577cdfd2cca5b27f32fefff6aaaa52f4.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/wiamis.2013.6616137"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

Sequential Organization of Speech in Reverberant Environments by Integrating Monaural Grouping and Binaural Localization

John Woodruff, DeLiang Wang
<span title="">2010</span> <i title="Institute of Electrical and Electronics Engineers (IEEE)"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/zcz4ey2iwffxtgaodtf5jtebmy" style="color: black;">IEEE Transactions on Audio, Speech, and Language Processing</a> </i> &nbsp;
S'09) received the B.F.A. degree in performing arts and technology and the B.S. degree in mathematics from the University of Michigan, Ann Arbor, in 2002 and 2004, respectively, and the M.Mus. degree in  ...  Pedersen for providing feedback on a preliminary draft of this manuscript.  ...  ACKNOWLEDGMENT The authors would like to thank the three anonymous reviewers for their constructive criticisms and suggestions. The authors would also like to thank M.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/tasl.2010.2050087">doi:10.1109/tasl.2010.2050087</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/y3w6s4okpfhxfko3gnlxbbr6sm">fatcat:y3w6s4okpfhxfko3gnlxbbr6sm</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20110608172845/http://www.cse.ohio-state.edu/~dwang/papers/Woodruff-Wang.taslp10.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/d6/9c/d69cb6b6000ebefbd09608d7e4bcf15ca1eb25ee.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/tasl.2010.2050087"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

DPLM: A Deep Perceptual Spatial-Audio Localization Metric [article]

Pranay Manocha, Anurag Kumar, Buye Xu, Anjali Menon, Israel D. Gebru, Vamsi K. Ithapu, Paul Calamia
<span title="2021-05-29">2021</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
We model localization similarity by utilizing activation-level distances from deep networks trained for direction of arrival (DOA) estimation.  ...  Specifically, we propose a framework for building a general purpose quality metric to assess spatial localization differences between two binaural recordings.  ...  We begin by building binaural direction-of-arrival (DOA) deep network models that act as surrogates for localization.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2105.14180v1">arXiv:2105.14180v1</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/yymhrnkqenev3c4f23psdgn6ve">fatcat:yymhrnkqenev3c4f23psdgn6ve</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20210602083439/https://arxiv.org/pdf/2105.14180v1.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/a9/2f/a92f36d910d52cbc79b120ba7d2ad093dbf61ffb.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2105.14180v1" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

Binaural bearing only tracking of stationary sound sources in reverberant environment

Ingo Kossyk, Michael Neumann, Zoltan-Csaba Marton
<span title="">2015</span> <i title="IEEE"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/p2nw3ufilvbknotfq56xtvnhtq" style="color: black;">2015 IEEE-RAS 15th International Conference on Humanoid Robots (Humanoids)</a> </i> &nbsp;
In this work we present a framework for the estimation of the Cartesian position of stationary sound sources in reverberant environments and under the influence of heavy clutter based on binaural bearing  ...  The feasibility of the presented methods is evaluated in simulations and we give first results of tracking performance when applied to real world binaural localization measurements of a sound source in  ...  Binaural Sound Source Localization In the literature the commonly used model for binaural localization of sound sources relies on the evaluation of the Interaural Time Difference (ITD) and Interaural Intensity  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/humanoids.2015.7363531">doi:10.1109/humanoids.2015.7363531</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/humanoids/KossykNM15.html">dblp:conf/humanoids/KossykNM15</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/xch3ahb7pze4jar3qmgihjqjqi">fatcat:xch3ahb7pze4jar3qmgihjqjqi</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190427081916/https://elib.dlr.de/100780/1/Kossyk_Humanoids2015_final.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/94/db/94db3c7a39587810111059c159cd1966a42fa4c0.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/humanoids.2015.7363531"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

Hearing in a shoe-box : binaural source position and wall absorption estimation using virtually supervised learning [article]

Saurabh Kataria , Antoine Deleforge
<span title="2017-03-20">2017</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
This paper introduces a new framework for supervised sound source localization referred to as virtually-supervised learning.  ...  An acoustic shoe-box room simulator is used to generate a large number of binaural single-source audio scenes.  ...  DESCRIPTION OF EXPERIMENTAL SETUP The problem of single-source localization in a reverberant room using a binaural receiver is considered.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1609.09747v2">arXiv:1609.09747v2</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/i4lxa5q7avgynjfakrnyutr3ba">fatcat:i4lxa5q7avgynjfakrnyutr3ba</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20191027143329/https://arxiv.org/pdf/1609.09747v2.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/c3/1e/c31ea25fc716e79ac89870b66fef50463359e6be.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1609.09747v2" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

Hearing in a shoe-box: Binaural source position and wall absorption estimation using virtually supervised learning

Saurabh Kataria, Clement Gaultier, Antoine Deleforge
<span title="">2017</span> <i title="IEEE"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/rc5jnc4ldvhs3dswicq5wk3vsq" style="color: black;">2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)</a> </i> &nbsp;
This paper introduces a new framework for supervised sound source localization referred to as virtually-supervised learning.  ...  An acoustic shoe-box room simulator is used to generate a large number of binaural single-source audio scenes.  ...  DESCRIPTION OF EXPERIMENTAL SETUP The problem of single-source localization in a reverberant room using a binaural receiver is considered.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/icassp.2017.7952151">doi:10.1109/icassp.2017.7952151</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/icassp/KatariaGD17.html">dblp:conf/icassp/KatariaGD17</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/sq23xoxvyzdf7hqzijpzqmnkga">fatcat:sq23xoxvyzdf7hqzijpzqmnkga</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20180724024800/https://hal.inria.fr/hal-01372435v2/document" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/14/8e/148ec1a1e657ca428aad7d506159fb06fd5fd1c4.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/icassp.2017.7952151"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

Binaural and Multiple-Microphone Signal Processing Motivated by Auditory Perception

Richard M. Stern, Evandro Gouvea, Chanwoo Kim, Kshitiz Kumar, Hyung-Min Park
<span title="">2008</span> <i title="IEEE"> 2008 Hands-Free Speech Communication and Microphone Arrays </i> &nbsp;
It is well known that binaural processing is very useful for separating incoming sound sources as well as for improving the intelligibility of speech in reverberant environments.  ...  This paper describes and compares a number of ways in which the classic model of interaural cross-correlation proposed by Jeffress, quantified by Colburn, and further elaborated by Blauert, Lindemann,  ...  Some binaural phenomena The human binaural system is remarkable in its ability to localize single and multiple sound sources, to separate and segregate signals coming from multiple directions, and to understand  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/hscma.2008.4538697">doi:10.1109/hscma.2008.4538697</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/clrpdcufgbaq7m7vqk4apw2t64">fatcat:clrpdcufgbaq7m7vqk4apw2t64</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170812223026/http://www.cs.cmu.edu/afs/cs/user/robust/www/Papers/Stern_HSCMA08.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/a8/d8/a8d84e42ccebaaf33a7e489eba9c23a8428ae550.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/hscma.2008.4538697"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

Comparison of a target-equalization-cancellation approach and a localization approach to source separation

Jing Mi, Matti Groll, H. Steven Colburn
<span title="">2017</span> <i title="Acoustical Society of America (ASA)"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/hwn3tbm3t5cpnjcflcmmwnprc4" style="color: black;">Journal of the Acoustical Society of America</a> </i> &nbsp;
Interaural differences are important for listeners to be able to maintain focus on a sound source of interest in the presence of multiple sources.  ...  In this paper, a different type of binaural cue for source-separation purposes is proposed.  ...  source of interest in the presence of multiple sources.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1121/1.5009763">doi:10.1121/1.5009763</a> <a target="_blank" rel="external noopener" href="https://www.ncbi.nlm.nih.gov/pubmed/29195469">pmid:29195469</a> <a target="_blank" rel="external noopener" href="https://pubmed.ncbi.nlm.nih.gov/PMC5685812/">pmcid:PMC5685812</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/e36wkgvgtbfahfahzrw7vbianm">fatcat:e36wkgvgtbfahfahzrw7vbianm</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20191121121631/http://europepmc.org/backend/ptpmcrender.fcgi?accid=PMC5685812&amp;blobtype=pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/d6/7a/d67aa6fc2a41c7873c535068392fd5461641ad36.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1121/1.5009763"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> Publisher / doi.org </button> </a> <a target="_blank" rel="external noopener" href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5685812" title="pubmed link"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> pubmed.gov </button> </a>

SAGRNN: Self-Attentive Gated RNN for Binaural Speaker Separation with Interaural Cue Preservation [article]

Ke Tan, Buye Xu, Anurag Kumar, Eliya Nachmani, Yossi Adi
<span title="2020-11-14">2020</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
In addition, our approach effectively preserves the interaural cues, which improves the accuracy of sound localization.  ...  We develop an end-to-end multiple-input multiple-output system, which directly maps from the binaural waveform of the mixture to those of the speech signals.  ...  We apply a binaural sound localization algorithm [24] to the binaural estimates, of which an open-source implementation is available.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2009.01381v2">arXiv:2009.01381v2</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/2lx7uy2n3jh67j7hs4aojcvhgm">fatcat:2lx7uy2n3jh67j7hs4aojcvhgm</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20201118004330/https://arxiv.org/pdf/2009.01381v2.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/3e/b3/3eb316bce668c44df830f2b529c8ad8c14bbffe6.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2009.01381v2" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

Low-bandwidth binaural beamforming

S. Srinivasan
<span title="">2008</span> <i title="Institution of Engineering and Technology (IET)"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/5njumqonprdi3jwzrhnh22m7pm" style="color: black;">Electronics Letters</a> </i> &nbsp;
For speech sources with a 8 kHz bandwidth in the presence of an interfering source, it is shown that good performance can be achieved with a cutoff frequency of 4 kHz.  ...  An efficient beamforming scheme for wireless binaural hearing aids is proposed that provides a trade-off between the transmission bit rate and the amount of noise reduction.  ...  Fig. 1 1 Improvement in SINR averaged over 0 -8 kHz for completely monaural (dash-dot), completely binaural (dashed), and proposed scheme (solid) for different locations of the interferer; desired source  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1049/el:20082021">doi:10.1049/el:20082021</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/qwuouxzv25frfdknljvg3dg3zq">fatcat:qwuouxzv25frfdknljvg3dg3zq</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20180723150445/https://pure.tue.nl/ws/files/3073433/Metis248244.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/9b/8b/9b8b5ad9cd192f660a4e5b28279a0596224960f5.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1049/el:20082021"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> Publisher / doi.org </button> </a>

An Adaptive Method Based on Multiscale Dilated Convolutional Network for Binaural Speech Source Localization

Lulu Wu, Hong Liu, Bing Yang, Runwei Ding, Zhile Yang
<span title="2020-12-30">2020</span> <i title="Hindawi Limited"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/y3fh56bfunh5fgneywwba6d4ke" style="color: black;">Complexity</a> </i> &nbsp;
Most binaural speech source localization models perform poorly in unprecedentedly noisy and reverberant situations.  ...  The multiscale dilated CNN can encode discriminative representations for CCF and ILD, respectively. After encoding, the individual interaural representations are fused to map source direction.  ...  Acknowledgments is work was supported by National Natural Science Foundation of China (nos. 61673030 and U1613209) and National Natural Science Foundation of Shenzhen (no. JCYJ20190808182209321).  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1155/2020/5819624">doi:10.1155/2020/5819624</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/6wemeixhfvff3apyedbvhjvhtm">fatcat:6wemeixhfvff3apyedbvhjvhtm</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20210225063530/https://downloads.hindawi.com/journals/complexity/2020/5819624.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/f6/79/f6798574896ace853c8e97ea6fb10b59c6683d64.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1155/2020/5819624"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> hindawi.com </button> </a>

Binaural Sound Source Localization Based on Convolutional Neural Network

Lin Zhou, Kangyu Ma, Lijie Wang, Ying Chen, Yibin Tang
<span title="">2019</span> <i title="Computers, Materials and Continua (Tech Science Press)"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/amujz7fcqna6do727z6ev3ueo4" style="color: black;">Computers Materials &amp; Continua</a> </i> &nbsp;
Binaural sound source localization (BSSL) in low signal-to-noise ratio (SNR) and high reverberation environment is still a challenging task.  ...  The CNN is then used to predict azimuth of sound source.  ...  [May, Van de Par and Kohlrausch (2011) ] divided the acoustic signal into multiple subbands and proposed the Gaussian mixed model (GMM) to model the binaural cues. Xiao et al.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.32604/cmc.2019.05969">doi:10.32604/cmc.2019.05969</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/rplbtxln2bh3dms43zl4sbjhkq">fatcat:rplbtxln2bh3dms43zl4sbjhkq</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200505130928/http://tsp.techscience.com//uploads/attached/file/20190725/20190725073239_12293.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/97/94/97943c6616914080a4aa72f1da380c0fb308dcb9.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.32604/cmc.2019.05969"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> Publisher / doi.org </button> </a>
&laquo; Previous Showing results 1 &mdash; 15 out of 4,266 results