Filters








198 Hits in 4.4 sec

Combining Temporal And Spectral Features In Hmm-Based Drum Transcription

Jouni Paulus, Anssi Klapuri
2007 Zenodo  
Combining the Spectral and Temporal Features Combining the information from temporal features to the baseline HMM recogniser can be done in several ways.  ...  CONCLUSIONS AND FUTURE WORK We have proposed to utilise temporal features in conjunction with a HMM-based system for transcribing drums from polyphonic audio.  ... 
doi:10.5281/zenodo.1417257 fatcat:vw47jwqdgvh4ziwjcv6zazha44

Drum Sound Detection in Polyphonic Music with Hidden Markov Models

Jouni Paulus, Anssi Klapuri
2009 EURASIP Journal on Audio, Speech, and Music Processing  
The results suggest that the transcription is possible using connected HMMs, and that using detector-like models for each target drum provides a better performance than modelling drum combinations.  ...  Two ways of using HMMs are studied: modelling combinations of the target drums and a detector-like modelling of each target drum.  ...  In addition to the MFCCs and their temporal derivatives, other spectral features, such as band energy ratios, spectral kurtosis, skewness, flatness, and slope used, for example, in [6] were considered  ... 
doi:10.1186/1687-4722-2009-497292 fatcat:py3cmelufre3be5vxdjybzrotq

Drum Sound Detection in Polyphonic Music with Hidden Markov Models

Jouni Paulus, Anssi Klapuri
2009 EURASIP Journal on Audio, Speech, and Music Processing  
The results suggest that the transcription is possible using connected HMMs, and that using detector-like models for each target drum provides a better performance than modelling drum combinations.  ...  Two ways of using HMMs are studied: modelling combinations of the target drums and a detector-like modelling of each target drum.  ...  In addition to the MFCCs and their temporal derivatives, other spectral features, such as band energy ratios, spectral kurtosis, skewness, flatness, and slope used, for example, in [6] were considered  ... 
doi:10.1155/2009/497292 fatcat:liwbajpsdrh4nomkikppxc4kzq

Real-Time Transcription and Separation of Drum Recordings Based on NMF Decomposition

Christian Dittmar, Daniel Gärtner
2014 International Conference on Digital Audio Effects  
The method can be applied in music production and music education software. Performance results with respect to drum transcription are presented and discussed.  ...  This paper proposes a real-time capable method for transcribing and separating occurrences of single drum instruments in polyphonic drum recordings.  ...  In [4] , HMMs are used to model MFCCs and their temporal derivatives.  ... 
dblp:conf/dafx/DittmarG14 fatcat:4cyojihosbdojk66bt7d4asr44

Drum Loops Retrieval from Spoken Queries

Olivier Gillet, Ga�l Richard
2005 Journal of Intelligent Information Systems  
The transcription task necessary to index the database is performed using Hidden Markov Models (HMM) and Support Vector Machines (SVM) and achieves a 86.4% correct recognition rate.  ...  Recent efforts in audio indexing and music information retrieval mostly focus on melody.  ...  It is based on two major components: • An automatic transcription tool for drum loops: this tool consists in automatically indexing a drum loop by segmenting the audio in successive strokes and in recognizing  ... 
doi:10.1007/s10844-005-0321-9 fatcat:7zafm2gipjetjpmwqkaltut3ym

From Low-Level To Song-Level Percussion Descriptors Of Polyphonic Music

Martín Haro, Perfecto Herrera
2009 Zenodo  
Paulus and Klapuri [8] presented an evaluation using Hidden Markov Models with a combination of spectral features and temporal descriptors calculated from long narrow-band frames.  ...  Secondly, we evaluated the capacity of an automatic drum transcription system, based on object-level features and three binary SVM models, to transcribe percussion events in polyphonic music.  ... 
doi:10.5281/zenodo.1417201 fatcat:llmxcec435gnlebcqzzr2t3qei

Detecting and Describing Percussive Events in Polyphonic Music

Martin Haro, Perfecto Herrera
2008 Zenodo  
We observe an improvement in the classification results when object-level temporal evolution descriptors are added to the feature set.  ...  Then we evaluate the binary models within a whole drum transcription system achieving comparable results with state of the art algorithms.  ...  The most frequently selected features per instrument are: • Bass drum: low Bark bands, MFCC, spectral energy low and spectral ux. • Snare drum: mid Bark bands , temporal lpc, MFCC and spectral atness.  ... 
doi:10.5281/zenodo.3744770 fatcat:ygwpor72vzdelk5jelzmlb7j2y

Transcription and Separation of Drum Signals From Polyphonic Music

O. Gillet, G. Richard
2008 IEEE Transactions on Audio, Speech, and Language Processing  
A complete drum transcription system is described, which combines information from the original music signal and a drum track enhanced version obtained by source separation.  ...  The purpose of this article is to present new advances in music transcription and source separation with a focus on drum signals.  ...  This is the philosophy followed by NMF or ISA-based methods, in which the spectral and temporal profiles play the role of simpler intermediate representations for which the joint optimization is easy.  ... 
doi:10.1109/tasl.2007.914120 fatcat:gpg35mng45d5vepzegxtcmmonu

Time-constrained sequential pattern discovery for music genre classification

Jia-Min Ren, Jyh-Shing Roger Jang
2011 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)  
However, for a genre classification task, most of the text categorization based approaches only capture local temporal dependences (e.g. statistics of unigrams and bigrams).  ...  Music consists of both local and long-term temporal information.  ...  In contrast, Chen's method [6] utilizes hidden Markov model (HMM) based clustering technique to convert each song into a sequence of HMM states, and Reed and Lee [7] proposed the use of acoustic segment  ... 
doi:10.1109/icassp.2011.5946368 dblp:conf/icassp/RenJ11 fatcat:7wmx2jd37jhbrpf3ybait7nrkm

Real-Time Recognition of Percussive Sounds by a Model-Based Method

Umut Şimşekli, Antti Jylhä, Cumhur Erkut, A. Taylan Cemgil
2010 EURASIP Journal on Advances in Signal Processing  
In this paper, we introduce a model-based algorithm for detection of percussive events and test the algorithm on the detection and classification of different percussive sounds.  ...  We focus on tuning the algorithm for a good compromise between temporal precision, classification accuracy and low latency.  ...  and Technology.  ... 
doi:10.1155/2011/291860 fatcat:oyoxjfhefbbgdbtkj2umjsrdie

Modal analysis and transcription of strokes of the mridangam using non-negative matrix factorization

Akshay Anantapadmanabhan, Ashwin Bellur, Hema A Murthy
2013 2013 IEEE International Conference on Acoustics, Speech and Signal Processing  
In this paper we use a Non-negative Matrix Factorization (NMF) based approach to analyze the strokes of the mridangam, a South Indian hand drum, in terms of the normal modes of the instrument.  ...  Using NMF, a dictionary of spectral basis vectors are first created for each of the modes of the mridangam.  ...  In section 4, the task of transcription using NMF along with HMMs is addressed. Section 5 details the results of the methods proposed for the task of transcription.  ... 
doi:10.1109/icassp.2013.6637633 dblp:conf/icassp/AnantapadmanabhanBM13 fatcat:dpv5ogmk5ffefjl4pxk2a345my

Four-way Classification of Tabla Strokes with Models Adapted from Automatic Drum Transcription

Rohit M A, Amitrajit Bhattacharjee, Preeti Rao
2021 Zenodo  
The acoustic correspondence between the tabla stroke categories and the common popular Western drum types motivates us to adapt models and methods from automatic drum transcription.  ...  To improve model robustness further, we investigate both drums and tabla-specific data augmentation strategies.  ...  Chordia [3] extended this work by targeting a larger, more diverse dataset, and using neural network (NN) and tree-based classifiers to categorize strokes based on spectral and temporal envelope features  ... 
doi:10.5281/zenodo.5624489 fatcat:c5kaz4tasvbhvmipvm466xvhgi

Analysis and automatic recognition of Human BeatBox sounds: A comparative study

Benjamin Picart, Sandrine Brognaux, Stephane Dupont
2015 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)  
Drum sounds typically use a notation based on plosives and fricatives, and instrument sounds cover vocalisations that go beyond spoken language vowels.  ...  In this paper, we investigate three complementary aspects of HBB analysis: pitch tracking, onset detection, and automatic recognition of sounds and instruments.  ...  and testing the HMM-based HBB recognizers.  ... 
doi:10.1109/icassp.2015.7178773 dblp:conf/icassp/PicartBD15 fatcat:itgcndusdvajvda24ehdfd56n4

An Efficient Temporally-Constrained Probabilistic Model For Multiple-Instrument Music Transcription

Emmanouil Benetos, Tillman Weyde
2015 Zenodo  
In the future, we will also incorporate temporal constraints on note transitions and polyphony level estimation and will continue work on instrument assignment by combining timbral features with PLCA-based  ...  Drum transcription can also be incorporated into the system, in the same way as in [5] .  ... 
doi:10.5281/zenodo.1418016 fatcat:s2bjqbfbubagxhxmki5ilweh7i

Segmentation And Recognition Of Tabla Strokes

Parag Chordia
2005 Zenodo  
ACKNOWLEDGEMENTS My sincere thanks to Olivier Gillet and Gael Richard for providing their data to me.  ...  A wide variety of spectral and temporal features were used, including spectral centroid, mel-frequency cepstral coefficients (mfccs), log-attack time and zero-crossing rate (zcr).  ...  The current work does not take into account the temporal evolution of features. This could be modeled with an hierarchical hmm.  ... 
doi:10.5281/zenodo.1416000 fatcat:tnpnv2wlbbceffonf5jzipnbau
« Previous Showing results 1 — 15 out of 198 results