Filters








1,138 Hits in 2.3 sec

Low-Latency Speaker-Independent Continuous Speech Separation [article]

Takuya Yoshioka, Zhuo Chen, Changliang Liu, Xiong Xiao, Hakan Erdogan, Dimitrios Dimitriadis
2019 arXiv   pre-print
architecture combined with a double buffering scheme and (2) by performing enhancement with a set of fixed beamformers followed by a neural post-filter.  ...  The previous SI-CSS method uses a neural network trained with permutation invariant training and a data-driven beamformer and thus requires much processing latency.  ...  The proposed enhancement scheme, combining the fixed beamformers with the post-filter, was less sensitive to the degradation in the TF mask quality.  ... 
arXiv:1904.06478v1 fatcat:wjozo4mthvel7bpwuuoiw6y4ny

Low-latency Speaker-independent Continuous Speech Separation

Takuya Yoshioka, Zhuo Chen, Changliang Liu, Xiong Xiao, Hakan Erdogan, Dimitrios Dimitriadis
2019 ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)  
This is achieved (1) by using a new speech separation network architecture combined with a double buffering scheme and (2) by performing enhancement with a set of fixed beamformers followed by a neural  ...  The previous SI-CSS method uses a neural network trained with permutation invariant training and a data-driven beamformer and thus requires much processing latency.  ...  The proposed enhancement scheme, combining the fixed beamformers with the post-filter, was less sensitive to the degradation in the TF mask quality.  ... 
doi:10.1109/icassp.2019.8682274 dblp:conf/icassp/YoshiokaCLXED19 fatcat:nvx2cfpbkrfr3jm2fv4f3v6ec4

Towards Fast Region Adaptive Ultrasound Beamformer for Plane Wave Imaging Using Convolutional Neural Networks [article]

Roshan P Mathews, Mahesh Raveendranatha Panicker
2021 arXiv   pre-print
The CNN architecture could leverage the spatial information and will be more region adaptive during the beamforming process.  ...  A comprehensive performance comparison in terms of qualitative and quantitative measures for fully connected neural network (FCNN), the proposed CNN architecture, MVDR and Delay and Sum (DAS) using the  ...  CONCLUSION This paper demonstrates the usefulness of an improvised CNN based beamforming method for ultrasound image quality enhancements in terms of contrast and resolution.  ... 
arXiv:2106.07006v2 fatcat:aclvijwpqzbxhbor7hna5ktiua

VMInNet: Interpolation of Virtual Microphones in Optimal Latent Space Explored by Autoencoder

Riki Takahashi, Li Li, Shoji Makino, Takeshi Yamada
2021 Journal of Signal Processing  
with high performance.  ...  This technique has been shown to be effective in improving the speech enhancement performance of beamforming in underdetermined situations.  ...  We use two-dimensional fully convolutional neural networks (CNNs) to design the autoencoder, which allows the network to handle inputs with arbitrary length and perform interpolation by taking the whole  ... 
doi:10.2299/jsp.25.245 fatcat:ao4ol5d2zbgbza6ebs22ccrram

SRIB-LEAP submission to Far-field Multi-Channel Speech Enhancement Challenge for Video Conferencing [article]

R G Prithvi Raj, Rohit Kumar, M K Jayesh, Anurenjan Purushothaman, Sriram Ganapathy, M A Basha Shaik
2021 arXiv   pre-print
The single channel speech enhancement is done in log spectral domain using convolution neural network (CNN)-long short term memory (LSTM) based architecture.  ...  We propose a two stage method involving a beamformer followed by single channel enhancement.  ...  This proposed model involving A-FaSNet based beamforming with the single channel enhancement pipeline also fares well when comparing with the enhanced baseline.  ... 
arXiv:2106.12763v1 fatcat:of4t4janf5d3besjpl2467x4nq

A Computer Vision Based Beamforming Scheme for Millimeter Wave Communication in LOS Scenarios [article]

Tianqi Xiang, Yaxin Wang, Huiwen Li, Boren Guo, Xin Zhang
2020 arXiv   pre-print
A wireless coverage model is built to investigate the coverage performance and influence of positioning accuracy achieved by convolutional neural network (CNN) for image processing.  ...  It is proved by simulations that the beamforming scheme is practicable and the mainstream CNN we employed is sufficient in both aspects of beam directivity accuracy and processing speed in frame per second  ...  Location-aware communication aims to utilize the location information for enhancing the usage of spatial dimension including beamforming. 5G dense network has been proposed for learning location information  ... 
arXiv:2006.11566v1 fatcat:jjtvoot6yvfkpj34arrfwznuju

3-D Feature and Acoustic Modeling for Far-Field Speech Recognition [article]

Anurenjan Purushothaman, Anirudh Sreeram, Sriram Ganapathy
2020 arXiv   pre-print
The 3-D CNN architecture allows the combination of multi-channel features that optimize the speech recognition cost compared to the traditional beamforming models that focus on the enhancement task.  ...  The MAR features are fed to a convolutional neural network (CNN) architecture which performs the joint acoustic modeling on the three dimensions.  ...  In order to enhance the learning of the non linearity in the filters of the 3-D CNN layer, we use the Network in Network (NIN) [38] architecture.  ... 
arXiv:1911.05504v2 fatcat:c66cew573vc7vng2hyhmfbuh4e

Hybrid Beamforming for MISO System via Convolutional Neural Network

Teng Zhang, Anming Dong, Chuanting Zhang, Jiguo Yu, Jing Qiu, Sufang Li, You Zhou
2022 Electronics  
In this work, a low-complexity convolutional neural network (CNN)-based HBF algorithm is proposed to solve the SE maximization problem under the constant modulus constraint and transmit power constraint  ...  Hybrid beamforming (HBF) is a promising approach to obtain a better balance between hardware complexity and system performance in massive MIMO communication systems.  ...  Further, [18] solved three beamforming optimization problems using DL to enhance HBF performance. All the papers mentioned above employ deep supervised learning to train the network.  ... 
doi:10.3390/electronics11142213 fatcat:y2xxlvvfijeyvengc3w3hrcfqy

Multi-Channel Automatic Speech Recognition Using Deep Complex Unet [article]

Yuxiang Kong, Jian Wu, Quandong Wang, Peng Gao, Weiji Zhuang, Yujun Wang, Lei Xie
2020 arXiv   pre-print
It also achieves superior performance than the recently proposed neural beamforming method.  ...  The front-end module in multi-channel automatic speech recognition (ASR) systems mainly use microphone array techniques to produce enhanced signals in noisy conditions with reverberation and echos.  ...  The acoustic model used in the back-end contains a 2-layer CNNs and a 12-layer time delay neural networks (TDNN).  ... 
arXiv:2011.09081v1 fatcat:ihsuw26r4nc2teoiyf45ovqeoi

End-to-End Multi-Channel Speech Enhancement Using Inter-Channel Time-Restricted Attention on Raw Waveform

Hyeonseung Lee, Hyung Yong Kim, Woo Hyun Kang, Jeunghun Kim, Nam Soo Kim
2019 Interspeech 2019  
The model first extracts sample-level speech embedding using channel-wise convolutional neural network (CNN) and compensates time-delays between the channels based on the embedding, resulting in timealigned  ...  This paper describes a novel waveform-level end-to-end model for multi-channel speech enhancement.  ...  Although these methods introduced endto-end filter prediction networks, the networks were optimized with back-end acoustic model only to minimize ASR costs, not speech enhancement losses.  ... 
doi:10.21437/interspeech.2019-2397 dblp:conf/interspeech/LeeKKKK19 fatcat:jn4uba3kobhmtknojxjktg3yoi

Speech Enhancement Based on Beamforming and Post-Filtering by Combining Phase Information

Rui Cheng, Changchun Bao
2020 Interspeech 2020  
In this paper, a multi-channel speech enhancement method is proposed, which combines beamforming and post-filtering based on neural network.  ...  With the development of microphone array signal processing technology and deep learning, the beamforming combined with neural network has provided a more diverse solution for this field.  ...  In recent years, traditional beamforming methods have been combined with deep neural networks (DNNs) to improve the performance of speech enhancement.  ... 
doi:10.21437/interspeech.2020-0990 dblp:conf/interspeech/ChengB20 fatcat:3hjxi5b4indyheyv7ly6ayxzzy

Joint Antenna Selection and Hybrid Beamformer Design using Unquantized and Quantized Deep Learning Networks [article]

Ahmet M. Elbir, Kumar Vijay Mishra
2019 arXiv   pre-print
We formulate antenna selection and hybrid beamformer design as a classification/prediction problem for convolutional neural networks (CNNs).  ...  Further investigations with quantized-CNNs show that the proposed network, saved in no more than 5 bits, is also suited for digital mobile devices.  ...  The second CNN (CNNRF) takes the input of the subsequent channel matrix with selected rows to choose RF beamformers.  ... 
arXiv:1905.03107v2 fatcat:j52rwhk2rzcevk2ofbrwrqwam4

Technology trends and applications of deep learning in ultrasonography: image quality enhancement, diagnostic support, and improving workflow efficiency

Jonghyon Yi, Ho Kyung Kang, Jae-Hyun Kwon, Kang-Sik Kim, Moon Ho Park, Yeong Kyeong Seong, DongWoo Kim, ByungEun Ahn, Kilsu Ha, Jinyong Lee, Zaegyoo Hah, Won-Chul Bang
2020 Ultrasonography  
Efforts towards workflow enhancement are also reviewed, with an emphasis on view recognition, scanning guide, image quality assessment, and quantification and measurement.  ...  Finally some future prospects are presented regarding image quality enhancement, diagnostic support, and improvements in workflow efficiency, along with remarks on hurdles, benefits, and necessary collaborations  ...  The FUIQA system was implemented with two deep CNN models, L-CNN and C-CNN.  ... 
doi:10.14366/usg.20102 pmid:33152846 pmcid:PMC7758107 fatcat:7twnmbinjve7parztuzc3maes4

Intelligent Radio Signal Processing: A Survey [article]

Quoc-Viet Pham and Nhan Thanh Nguyen and Thien Huynh-The and Long Bao Le and Kyungchun Lee and Won-Joo Hwang
2021 arXiv   pre-print
In particular, each theme is presented in a dedicated section, starting with the most fundamental principles, followed by a review of up-to-date studies and a summary.  ...  Intelligent signal processing for wireless communications is a vital task in modern wireless systems, but it faces new challenges because of network heterogeneity, diverse service requirements, a massive  ...  A similar method to solve the SRM problem was used by proposing a CNN-based beamforming prediction network (BPNet) [148] .  ... 
arXiv:2008.08264v3 fatcat:4wmxyio6ejfvbfnqodq5z426m4

Neural networks for distant speech recognition

Steve Renals, Pawel Swietojanski
2014 2014 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA)  
We performed experiments on the AMI and ICSI meeting corpora, with results indicating that neural network models are capable of significant improvements in accuracy compared with discriminatively trained  ...  In particular we investigate the use of convolutional and fully-connected neural networks with different activation functions (sigmoid, rectified linear, and maxout).  ...  2 . 2 WER (%) on AMI and ICSI -MDM with beamforming System AMI ICSI BMMI GMM-HMM (LDA+STC) 54.8 46.8 DNN -Sigmoid 49.5 41.0 DNN -ReLU 46.3 38.7 DNN -Maxout 46.4 39.0 CNN -Sigmoid 46.3 39.5 CNN  ... 
doi:10.1109/hscma.2014.6843274 dblp:conf/hscma/RenalsS14 fatcat:zfjpq5de7fdzvbqc4p7vcng72m
« Previous Showing results 1 — 15 out of 1,138 results