A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Filters
General algorithms for estimating spectrogram and transfer functions of target signal for blind suppression of diffuse noise
2013
2013 IEEE International Workshop on Machine Learning for Signal Processing (MLSP)
We propose two algorithms for jointly estimating the power spectrogram and the room transfer functions of a target signal in diffuse noise. ...
These estimates can be used to design a multichannel Wiener filter, and thereby separate a target signal from an unknown direction from diffuse noise. ...
Experimental results
CONCLUSION We proposed two algorithms for joint estimation of the power spectrogram and the room transfer functions of the target signal for blind suppression of diffuse noise. ...
doi:10.1109/mlsp.2013.6661984
dblp:conf/mlsp/ItoVOS13
fatcat:tfu7doejoreohbq2itfk7tzq5u
Blind speech extraction for Non-Audible Murmur speech with speaker's movement noise
2012
2012 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)
In this paper, we aim to achieve further improvement in the noise reduction ability by changing the noise estimation and postprocessing algorithms to enhance the target NAM signal. ...
In order to reduce the noise signal, blind noise reduction using stereo NAM signals detected with two NAM microphones has been proposed by some of the authors. ...
ACKNOWLEDGMENT This work was supported by the MIC SCOPE, and JST Core Research of Evolution Science and Technology (CREST), Japan. ...
doi:10.1109/isspit.2012.6621308
dblp:conf/isspit/ItoiMTSS12
fatcat:7pkkm6vax5cgtpw3hcvpw4uggm
A blind source separation framework for ego-noise reduction on multi-rotor drones
2020
IEEE/ACM Transactions on Audio Speech and Language Processing
The pre-alignment not only improves the performance of clustering and permutation alignment, but also solves the target-channel selection problem for BSS. ...
To address this problem, we propose a blind source separation (BSS) framework that extracts a target sound from noisy multi-channel signals captured by a microphone array mounted on a drone. ...
Blind source separation (BSS) performs sound enhancement by treating the target and noise signals equally and by separating the sources from the mixed signals captured by the array of microphones [18] ...
doi:10.1109/taslp.2020.3015027
fatcat:hdg3gj3zwfdidkhyk4ow4sv4qm
A Neural Beamspace-Domain Filter for Real-Time Multi-Channel Speech Enhancement
2022
Symmetry
This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY ...
function between the sound source and microphones of the array, to generate multi-channel pairs for experiments. ...
the noise component of each pre-generated beam and fuse them. ...
doi:10.3390/sym14061081
dblp:journals/symmetry/LiuLWYCZL22
fatcat:4kxd4job5ndwrplygqvw7f3ii4
Multi-talker speech recognition under ego-motion noise using Missing Feature Theory
2010
2010 IEEE/RSJ International Conference on Intelligent Robots and Systems
Since typical solutions to (1) and (2), motor noise suppression and sound source separation, both introduce distortion to the processed signals, the performance of automatic speech recognition (ASR) deteriorates ...
For this purpose, we model masks that filter unreliable speech features based on the ratio of speech and motor noise energies. ...
It is based on a hybrid algorithm that exerts Blind Source Separation (BSS) [14] and beamforming. ...
doi:10.1109/iros.2010.5650112
dblp:conf/iros/InceNRTI10
fatcat:v334qyuxojakfnlwm47rmtxf7e
Acoustic Self-Awareness of Autonomous Systems in a World of Sounds
2020
Proceedings of the IEEE
Not only generic methods for robust source localization and signal extraction but also specific models and estimation methods for ego-noise based on various learning techniques are discussed. ...
As a first step, the state of the art of relevant generic techniques for acoustic scene analysis (ASA) is reviewed, i.e., source localization and the various facets of signal enhancement, including spatial ...
The authors propose a frequency-domain semi-blind source separation algorithm to estimate the noise signals and obtain an enhanced desired signal by applying an MWF. ...
doi:10.1109/jproc.2020.2977372
fatcat:immaqhfnkna6xdwj3dqlh7qewi
2020 Index IEEE Signal Processing Letters Vol. 27
2020
IEEE Signal Processing Letters
., +, LSP 2020 885-889
E
Echo suppression
A Robust Affine Projection Algorithm Against Impulsive Noise. ...
, W., A Tunable Detector for Distributed Target Detection in the Situation of Signal Mismatch; LSP 2020 151-155 Tang, W., Jiang, H., and Zhang, Q., Range-Angle Decoupling and Estimation for FDA-MIMO Radar ...
doi:10.1109/lsp.2021.3055468
fatcat:wfdtkv6fmngihjdqultujzv4by
2020 Index IEEE/ACM Transactions on Audio, Speech, and Language Processing Vol. 28
2020
IEEE/ACM Transactions on Audio Speech and Language Processing
., +, TASLP 2020 77-91
Perceptually-Transparent Online Estimation of Two-Channel Room Transfer
Function for Sound Calibration. ...
., +, TASLP 2020 92-104
Perceptually-Transparent Online Estimation of Two-Channel Room Transfer
Function for Sound Calibration. ...
T Target tracking Multi-Hypothesis Square-Root Cubature Kalman Particle Filter for Speaker Tracking in Noisy and Reverberant Environments. Zhang, Q., +, TASLP 2020 1183 -1197 ...
doi:10.1109/taslp.2021.3055391
fatcat:7vmstynfqvaprgz6qy3ekinkt4
Multichannel Identification and Nonnegative Equalization for Dereverberation and Noise Reduction Based on Convolutive Transfer Function
2018
IEEE/ACM Transactions on Audio Speech and Language Processing
Fig. 3 depicts the spectrogram examples of the proposed method for both noise-free and noisy signal. ...
Probabilistic techniques use expectationmaximization (EM) algorithm to maximize the likelihood of a generative model of the noisy microphone signals, such as [12] , [13] using relative early transfer ...
doi:10.1109/taslp.2018.2839362
fatcat:afl2zvzgtzddpnvxspioj2wsnu
A non-intrusive method for estimating binaural speech intelligibility from noise-corrupted signals captured by a pair of microphones
2018
Speech Communication
The proposed approach is able to estimate intelligibility in stationary and fluctuating noises, when the noise masker is presented as a point or diffused source, and is spatially separated from the target ...
The approach combines signal processing techniques in blind source separation and localisation, with an intrusive objective intelligibility measure (OIM). ...
Acknowledgments This work was supported by the EPSRC Programme Grant S3A: Future Spatial Audio for an Immersive Listener Experience at Home (EP/L000539/1) and the BBC as part of the BBC Audio Research ...
doi:10.1016/j.specom.2017.12.005
fatcat:s6hdi45m6ba2hjilpjlsetk5eu
Table of Contents
2020
IEEE Signal Processing Letters
Jia, and X. Fan 1220 Estimating the Number of Sinusoids in Additive Sub-Gaussian Noise With Finite Measurements . . . . . . . . . . . . . . H. ...
M. de Faria 965 Global Optimization for Recovery of Clipped Signals Corrupted With Poisson-Gaussian Noise . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ...
Jeong 1530 Nyström Kernel Algorithm Under Generalized Maximum Correntropy Criterion . . . . . . . . . . . . . . . . T. Zhang and S. ...
doi:10.1109/lsp.2020.3040840
fatcat:ezrfzwo6tjbkfhohq2tgec4m6y
Evaluating Source Separation Algorithms With Reverberant Speech
2010
IEEE Transactions on Audio, Speech, and Language Processing
echo, and Reverberation, of the Target and Masker (DERTM), which is closely related to the ASR results. ...
In reverberation, however, while signal separation has some benefit for ASR, the results are still far below those of human listeners facing the same task. ...
The three signals of interest are then defined as
Fig. 6 . 6 BSS_EVAL evaluation of ground truth and algorithmic masking systems as a function of target-to-masker ratio. ...
doi:10.1109/tasl.2010.2052252
fatcat:4sydiw3d2vbf3hj4jjrxcnfaoq
Techniques to Obtain Good Resolution and Concentrated Time-Frequency Distributions: A Review
2009
EURASIP Journal on Advances in Signal Processing
We present a review of the diversity of concepts and motivations for improving the concentration and resolution of timefrequency distributions (TFDs) along the individual components of the multi-component ...
The objective is the precise description of spectral content of a signal with respect to time, so that first, necessary mathematical and physical principles may be developed, and second, accurate understanding ...
It is driven by zero-mean stationary white noise e[n] so that x[n] = n m=−∞ h[n, m]e[m], H(n, ω) = n m=−∞ h[n, m]e −iω(n−m) (3) is the Zadeh's generalized transfer function (GTF) of the system evaluated ...
doi:10.1155/2009/673539
fatcat:zlbsdxxxm5hp5cnwbx3yl73fvi
Visual Acoustic Matching
[article]
2022
arXiv
pre-print
Given an image of the target environment and a waveform for the source audio, the goal is to re-synthesize the audio to match the target room acoustics as suggested by its visible geometry and materials ...
To address this novel task, we propose a cross-modal transformer model that uses audio-visual attention to inject visual properties into the audio and generate realistic audio output. ...
Acknowledgements UT Austin is supported in part by a gift from Google and the IFML NSF AI Institute. ...
arXiv:2202.06875v2
fatcat:qt6he2ckazgaxp6chln2m73kwa
Deep Ad-hoc Beamforming
[article]
2021
arXiv
pre-print
We have developed many implementations of the proposed framework and conducted an extensive experiment in scenarios where the locations of the speech sources are far-field, random, and blind to the microphones ...
Results on speech enhancement tasks show that our method outperforms its counterpart that works with linear microphone arrays by a considerable margin in both diffuse noise reverberant environments and ...
DeLiang Wang for helpful discussions. ...
arXiv:1811.01233v7
fatcat:rajjon4hgvbwvn5fetdnixd22e
« Previous
Showing results 1 — 15 out of 261 results