Filters








9 Hits in 3.2 sec

R-CRNN: Region-based Convolutional Recurrent Neural Network for Audio Event Detection

Chieh-Chi Kao, Weiran Wang, Ming Sun, Chao Wang
2018 Interspeech 2018  
This paper proposes a Region-based Convolutional Recurrent Neural Network (R-CRNN) for audio event detection (AED).  ...  The proposed network is inspired by Faster-RCNN [1], a wellknown region-based convolutional network framework for visual object detection.  ...  Conclusions A Region-based Convolutional Recurrent Neural Network is proposed for AED.  ... 
doi:10.21437/interspeech.2018-2323 dblp:conf/interspeech/KaoWSW18 fatcat:x6tiaje43rfvhmf4kxxmhfzluu

MTF-CRNN: Multiscale Time-Frequency Convolutional Recurrent Neural Network For Sound Event Detection

Keming Zhang, Yuanwen Cai, Yuan Ren, Ruida Ye, Liang He
2020 IEEE Access  
To reduce neural network parameter counts and improve sound event detection performance, we propose a multiscale time-frequency convolutional recurrent neural network (MTF-CRNN) for sound event detection  ...  INDEX TERMS Pattern recognition, sound event detection, multiscale learning, time-frequency transform, convolutional recurrent neural network.  ...  the CNN and RNN for rare sound event detection [11] . • R-CRNN: The R-CRNN is a Region-based CRNN to detect audio event and achieved the best performing single-model method among all methods without  ... 
doi:10.1109/access.2020.3015047 fatcat:hzox2myax5gapo4so5wthrcd4q

Learning How to Listen: A Temporal-Frequential Attention Model for Sound Event Detection

Yu-Han Shen, Ke-Xin He, Wei-Qiang Zhang
2019 Interspeech 2019  
In this paper, we propose a temporal-frequential attention model for sound event detection (SED).  ...  With these two models, we attempt to make our system pay more attention to important frames or segments and important frequency components for sound event detection.  ...  [5] proposed a Region-based Convolutional Recurrent Neural Network (R-CRNN) to improve previous work in 2018.  ... 
doi:10.21437/interspeech.2019-2045 dblp:conf/interspeech/ShenHZ19 fatcat:weav3oilyrctlgwdb5b5xwppni

Learning How to Listen: A Temporal-Frequential Attention Model for Sound Event Detection [article]

Yu-Han Shen, Ke-Xin He, Wei-Qiang Zhang
2018 arXiv   pre-print
In this paper, we propose a temporal-frequential attention model for sound event detection (SED).  ...  With these two models, we attempt to make our system pay more attention to important frames or segments and important frequency components for sound event detection.  ...  [5] proposed a Region-based Convolutional Recurrent Neural Network (R-CRNN) The corresponding author is Wei-Qiang Zhang. to improve previous work in 2018.  ... 
arXiv:1810.11939v1 fatcat:tga6n7b7rzfere3dskpthiq4gu

Proposal-based Few-shot Sound Event Detection for Speech and Environmental Sounds with Perceivers [article]

Piper Wolters, Chris Daw, Brian Hutchinson, Lauren Phillips
2021 arXiv   pre-print
Motivated by a lack of suitable benchmark datasets for few-shot audio event detection, we generate and evaluate on two novel episodic rare sound event datasets: one using clips of celebrity speech as the  ...  In this paper, we propose novel approaches to few-shot sound event detection utilizing region proposals and the Perceiver architecture, which is capable of accurately localizing sound events with very  ...  The result is a sequence of per-timestep feature vectors, with time resolution downsampled by a factor of 8. 2) Region Proposal Network: Our RPN is based on R-CRNN, another region-based convolutional recurrent  ... 
arXiv:2107.13616v1 fatcat:qvp553hrerfm7clpi7ggrbjxlu

Multi-Scale Time-Frequency Attention for Acoustic Event Detection [article]

Jingyang Zhang, Wenhao Ding, Jintao Kang, Liang He
2019 arXiv   pre-print
Most attention-based methods only concentrate along the time axis, which is insufficient for Acoustic Event Detection (AED).  ...  We demonstrate the proposed method on Task 2 of Detection and Classification of Acoustic Scenes and Events (DCASE) 2017 Challenge.  ...  [7] , on the contrary, applied Region Proposal Network [8] to perform event-level detection and directly generate the onset/offset of target events.  ... 
arXiv:1904.00063v3 fatcat:dzq2n3keavhc7mpucvlzdqucya

Multi-Scale Time-Frequency Attention for Acoustic Event Detection

Jingyang Zhang, Wenhao Ding, Jintao Kang, Liang He
2019 Interspeech 2019  
Most attention-based methods only concentrate along the time axis, which is insufficient for Acoustic Event Detection (AED).  ...  We demonstrate the proposed method on Task 2 of Detection and Classification of Acoustic Scenes and Events (DCASE) 2017 Challenge.  ...  [7] , on the contrary, applied Region Proposal Network [8] to perform event-level detection and directly generate the onset/offset of target events.  ... 
doi:10.21437/interspeech.2019-1587 dblp:conf/interspeech/ZhangDKH19 fatcat:3sr7sk44azddvkejrpww3qznfi

Adaptive Multi-scale Detection of Acoustic Events [article]

Wenhao Ding, Liang He
2019 arXiv   pre-print
By taking advantage of the hourglass neural network and gated recurrent unit (GRU) module, our AdaMD produces multiple predictions at different temporal and frequency resolutions.  ...  The goal of acoustic (or sound) events detection (AED or SED) is to predict the temporal position of target events in given audio segments.  ...  In view of the excellent performance of convolutional recurrent neural network (CRNN) [12] in processing time sequences [13] , [14] , our high-level structure consists of a CNN and a RNN network.  ... 
arXiv:1911.06878v2 fatcat:mgrtw2sntjhmnojcdfbnrozcyu

An Initial Machine Learning-Based Victim's Scream Detection Analysis for Burning Sites

Fairuz Samiha Saeed, Abdullah Al Bashit, Vishu Viswanathan, Damian Valles
2021 Applied Sciences  
This research work presents an audio-based automated system for victim detection in fire emergencies, investigating two machine learning (ML) methods: support vector machines (SVM) and long short-term  ...  It is time-critical to detect the victims trapped inside the burning sites for facilitating the rescue operations.  ...  Data Availability Statement: The dataset samples used for this study are available at https://drive. google.com/drive/folders/1RweIZZ5T5FC2uG96kyXKlczlzVB463Ea (accessed on 9 August 2021).  ... 
doi:10.3390/app11188425 fatcat:modpqxd5zfditaerh6stl2kr3u