Filters








491 Hits in 8.2 sec

Matrix quantization and mixed excitation based linear predictive speech coding at very low bit rates

Selma Özaydın, Buyurman Baykal
2003 Speech Communication  
A matrix quantization scheme and a very low bit rate vocoder is developed to obtain good quality speech for low capacity communication links.  ...  For the MQME vocoder, listening tests have proven that an efficient and high quality coding has been achieved at a bit rate of 1200 bps.  ...  Acknowledgements The authors would like to thank the reviewers for their constructive criticism and helpful suggestions which helped to improve the presentation of this paper.  ... 
doi:10.1016/s0167-6393(03)00009-8 fatcat:srvujfa2zbhcfjsj3wz3q3madm

Modeling of Phoneme Transitions for Natural Synthesis of Speech Signals

H. M., J. V.
2018 International Journal of Computer Applications  
This study basically focuses on introducing a novel method with low bit rate to improve the naturalness of synthetic speech.  ...  They are Fast Fourier Transform (FFT) algorithm, Auto Regressive model (AR) with Linear Predictive Coding (LPC) algorithm and Auto Regressive Moving Average model (ARMA) with Steiglitz-McBride method.  ...  This is a low bit rate technique which is more useful for most of the speech synthesis systems, because of utilization of lesser number of parameters.  ... 
doi:10.5120/ijca2018918008 fatcat:32saqjezgnd7zlowjb7uzlffqi

New Nonuniform Transmission and ADPCM Coding System for Improving Both Signal-to-Noise Ratio and Bit Rate

E Lahalle, G Fleury, R Zgheib
2011 IEEE Signal Processing Letters  
The proposed system is demonstrated for audio-signal compression and compared to the ADPCM G.726 standard. The new system yields improvements in both signal-to-noise ratio and average bit rate.  ...  We propose a new adaptive sampling (nonuniform transmission) method combined with the adaptive reconstruction algorithm. A new NUT-ADPCM coding-decoding system is designed.  ...  We consider the average bit rate, given by r = (T B +1)F s , where T stands for the proportion of samples transmitted. For comparison purposes, Table I shows the results of the three methods.  ... 
doi:10.1109/lsp.2010.2102016 fatcat:kp2dqlfkpzcmxfoupxwovlmmdu

Quantization Of The Lpc Model Vv1Th The Reconstruction Error Distortion Measure

P.M.T. Broersen, J.S. Erkelens
1996 Zenodo  
Publication in the conference proceedings of EUSIPCO, Trieste, Italy, 1996  ...  CONCLUSIONS A new distortion measure, the Reconstruction Error Distortion measure, is proposed for the purpose of quantization of the LPC models in low bit rate speech coders.  ...  INTRODUCTION Accurate quantization of LPC models is very important for the quality of low bitrate speech coders.  ... 
doi:10.5281/zenodo.36337 fatcat:r7rh3rstmzcadlwcbgvdx6wi3a

Preface: Special Section: Advances in Speech, Music and Audio Signal processing (Articles 1–13)

K. C. Santosh, Surekha Borra, Amit Joshi, Nilanjan Dey
2019 International Journal of Speech Technology  
In the fourth article, Mohan et al. presented a low bit-rate speech coding method based on multicomponent amplitude and frequency modulated signal model, the Fourier-Bessel series expansion and the discrete  ...  The symmetric Itakura-Saito and the root-mean-square logspectral distance measures are used for comparison of the original and reconstructed speech signals.  ...  In the fourth article, Mohan et al. presented a low bit-rate speech coding method based on multicomponent amplitude and frequency modulated signal model, the Fourier-Bessel series expansion and the discrete  ... 
doi:10.1007/s10772-019-09606-9 fatcat:hq2vwvgoojeehmyozauohgisxe

Adaptive multiresolution decomposition: application to lossless image compression

Bekkouche, Barret
2002 IIEEE International Conference on Acoustics Speech and Signal Processing  
The proposed scheme gives, on average, smaller lossless compression bit rate. However, This improvement in performance is achieved at the expense of an increase in computational complexity.  ...  In this paper we introduce the use of adaptive filter banks in lossless compression of images with progressive coding in resolution.  ...  It can be clearly seen that the use of an ARMA estimation model (r P T a H ) gives smaller compression bit rate than an AR model (r P a H).  ... 
doi:10.1109/icassp.2002.1004675 fatcat:4eu4tbjfi5gzlmjdyqeryddeam

Adaptive multiresolution decomposition: Application to lossless image compression

Hocine Bekkouche, Michel Barret
2002 IEEE International Conference on Acoustics Speech and Signal Processing  
The proposed scheme gives, on average, smaller lossless compression bit rate. However, This improvement in performance is achieved at the expense of an increase in computational complexity.  ...  In this paper we introduce the use of adaptive filter banks in lossless compression of images with progressive coding in resolution.  ...  It can be clearly seen that the use of an ARMA estimation model (r P T a H ) gives smaller compression bit rate than an AR model (r P a H).  ... 
doi:10.1109/icassp.2002.5745417 dblp:conf/icassp/BekkoucheB02 fatcat:yrtsmgexmvexvlcajmmet7tvp4

Multispectral code excited linear prediction coding and its application in magnetic resonance images

Jian-Hong Hu, Yao Wang, P.T. Cahill
1997 IEEE Transactions on Image Processing  
This paper reports a multispectral code excited linear prediction (MCELP) method for the compression of multispectral images.  ...  wavelet (EZW) coding method, and the vector tree (VT) coding method, as well as the multispectral segmented autoregressive moving average (MSARMA) method we developed previously.  ...  signal, such as Gaussian noise which is used in low bit rate speech coding [5] , [6] .  ... 
doi:10.1109/83.641415 pmid:18282913 fatcat:fhtzqahotbc4jfwzs4m63kueoe

Non-linear Prediction of Speech Signal Using Artificial Neural Nets [chapter]

K. Ashouri, M. Amini, M. H. Savoji
2002 Lecture Notes in Computer Science  
Therefore, considering this non-linearity should lead to lower signal dynamics during coding with a consequent reduction in bit-rate and the needed bandwidth.  ...  Prediction of speech signal has applications in speech technology, especially in coding. Conventionally linear prediction is used. However, non-linear phenomena exist in speech production.  ...  Linear prediction is used conventionally to reduce the redundancy of speech signal and decrease the bit-rate in coding.  ... 
doi:10.1007/3-540-36087-5_25 fatcat:d5vcju6o5jhznncz6ln645qrci

Re-estimation of linear predictive parameters in sparse linear prediction

Daniele Giacobello, Manohar N. Murthi, Mads Graesboll Christensen, Soren Holdt Jensen, Marc Moonen
2009 2009 Conference Record of the Forty-Third Asilomar Conference on Signals, Systems and Computers  
This approach defines predictors that look for a sparse residual rather than a minimum variance one with direct applications to coding but also consistent with the speech production model of voiced speech  ...  frame independent coding for speech communications over packet networks. i ii List of Papers The main body of this thesis consists of the following papers: [A] D.  ...  to low bit-rate speech coding.  ... 
doi:10.1109/acssc.2009.5470202 fatcat:qu3dseg5w5fahdyl62ic6ypyxq

ARMA companding scheme with improved symbol error rate for PAPR reduction in OFDM systems

Yasir Rahmatallah, Nidhal Bouaynaya, Seshadri Mohan
2010 2010 Wireless Telecommunications Symposium (WTS)  
The proposed system estimates a few autoregressive moving average (ARMA) model parameters of the difference signal between the companded and uncompanded OFDM envelopes and passes these parameters to the  ...  Upon receiving the ARMA model parameters, the receiver regenerates the difference signal and then adds it to the received companded OFDM envelope to recover the uncompanded OFDM signal.  ...  ACKNOWLEDGMENT The authors would like to thank the National Science Foundation for partly supporting this research through the NSF Grant: EPS-0701890.  ... 
doi:10.1109/wts.2010.5479631 dblp:conf/wts/RahmatallahBM10 fatcat:d2ulkvpjine2xpidau2exu4mli

On perceptual distortion minimization and nonlinear least-squares frequency estimation

M.G. Christensen, S.H. Jensen
2006 IEEE Transactions on Audio, Speech, and Language Processing  
The topic of this thesis is parametric coding of speech and audio. A number of estimation and modeling problems in this field of research are addressed.  ...  Based on rate-distortion optimization, an optimal segmentation and allocation of bits can be found, but this requires that distortions are calculated for all allocations and segments.  ...  Speech coders can code speech very efficiently at very low bit-rates but they do not perform well for music. Audio coders can code both music and speech well, but at higher bit-rates.  ... 
doi:10.1109/tsa.2005.860347 fatcat:ofqzufshdza5fbpwe6hoc6tbya

Biologically-Inspired Spike-Based Automatic Speech Recognition of Isolated Digits Over a Reproducing Kernel Hilbert Space

Kan Li, José C. Príncipe
2018 Frontiers in Neuroscience  
As a proof of concept, we demonstrate its capabilities using the benchmark TI-46 digit corpus for isolated-word automatic speech recognition (ASR) or keyword spotting.  ...  For spike-train front-end, spike-KAARMA also outperformed state-of-the-art SNN solutions.  ...  Harris for his helpful discussions during the research. We are also thankful to the editor and reviewers for their valuable comments and suggestions that improved the manuscript.  ... 
doi:10.3389/fnins.2018.00194 pmid:29666568 pmcid:PMC5891646 fatcat:weji6gclmzbrjl5frmbipfmjwy

Linear prediction of the one-sided autocorrelation sequence for noisy speech recognition

J. Hernando, C. Nadeu
1997 IEEE Transactions on Speech and Audio Processing  
The aim of this correspondence is to present a robust representation of speech based on AR modeling of the causal part of the autocorrelation sequence.  ...  In noisy speech recognition, this new representation achieves better results than several other related techniques.  ...  low.  ... 
doi:10.1109/89.554273 fatcat:da4ze7m67ved5aovnql5chbmla

ARMAS: Active Reconstruction of Missing Audio Segments [article]

Zohra Cheddad, Abbas Cheddad
2022 arXiv   pre-print
Nevertheless, prior traditional methods with linear interpolation, phase coding and tone insertion techniques are still in vogue.  ...  for audio inpainting) steganography provides.  ...  ACKNOWLEDGMENT We thank the reviewers for their neutrality in assessing this manuscript and for the constructive feedback.  ... 
arXiv:2111.10891v3 fatcat:7qvzsrzmozgzll3tqsm4pkdhri
« Previous Showing results 1 — 15 out of 491 results