66,613 Hits in 3.2 sec

A Generative Product-of-Filters Model of Audio [article]

Dawen Liang, Matthew D. Hoffman, Gautham J. Mysore
2014 arXiv   pre-print
We propose the product-of-filters (PoF) model, a generative model that decomposes audio spectra as sparse linear combinations of "filters" in the log-spectral domain.  ...  We demonstrate PoF's potential for audio processing on a bandwidth expansion task, and show that PoF can serve as an effective unsupervised feature extractor for a speaker identification task.  ...  Product-of-Filters Model We are interested in modeling audio spectrograms, which are collections of Fourier magnitude spectra W taken from some set of audio signals, where W is an F × T non-negative matrix  ... 
arXiv:1312.5857v5 fatcat:4s6zzfnu7vgtllgtddsriyc734

Modeling the Sound of Astronaut Voice Communications

Aaron Geldert, Tom Railio
2021 Zenodo  
The model presented here is composed of four stages: pre-filtering, nonlinear processing, noise generation, and post-filtering.  ...  This vintage audio effect is relevant to sound design, post-production, and a range of other artistic uses.  ...  The emulation of many types of vintage sounds is often desirable in audio production work.  ... 
doi:10.5281/zenodo.5724061 fatcat:n37sws5yzfanbisq4ftvlluf5i

Samples Homogenization For Interactive Soundscapes

Jorge Garcia
2011 Zenodo  
Then, a study about the transformations that can be applied using this model are presented as equalization methods.  ...  This thesis presents the challenges and current state of the art related to soundscapes modeling and design.  ...  By the date of March 2011 they presented a new product to generate audio in real time dependant on gestures coming from game controllers (like the WiiMote), AudioGesture 26 .  ... 
doi:10.5281/zenodo.1164267 fatcat:3ixfe3zpvjervasp4bbvkc77ju


Saurabh R Prasad, Pawan K. Gaikwad, Yashwant V Joshi
2020 Zenodo  
Digital audio effects refer to all those algorithms that are used for enhancing sound in any of the steps of a processing chain of music production.  ...  Real time audio effects generation is a highly challenging task in the field of signal processing.  ...  The Simulink model is designed without any prior calculation of model parameters thus the model design is supported by trial and error methodology [7] [8] . 2.THEORY OF GENERATED AUDIO EFFECTS WITH  ... 
doi:10.5281/zenodo.3987949 fatcat:v3j57uj4m5b73eel3uctmcbtte

A Survey of Tensor Factorization Frameworks on Audio Modelling

Unsal Gokdag
2014 International Journal of Applied Mathematics Electronics and Computers  
This survey is about Tensor Factorization methods for audio modeling, which focuses on probabilistic latent tensor factorization and generalized coupled tensor factorization by expectation maximization  ...  The model resembles physically inspired source filter models of audio production in spectral domain by multiplying harmonic excitation withspectral envelope of a body response filter. { } { } { } { } {  ...  A tensor factorization (TF) model is the product of a set of tensors for which defined on the corresponding index set marginalized over the set of indices .  ... 
doi:10.18100/ijamec.70262 fatcat:5ooutd6x3rbzvn6adtwupx76zy

Speech Synthesis and Control Using Differentiable DSP [article]

Giorgio Fabbro, Vladimir Golkov, Thomas Kemp, Daniel Cremers
2020 arXiv   pre-print
We propose a new neural vocoder that offers control of such factors of variation.  ...  In this work we move towards a speech synthesis system that can produce diverse speech renditions of a text by allowing (but not requiring) explicit control over the various factors of variation.  ...  Note that the usage of an oscillator to model the voiced parts of speech (vowels) and a filtered-noise generator to model consonant sounds makes our model similar to spectral modeling synthesis [25]  ... 
arXiv:2010.15084v1 fatcat:zljruy4eirazhbc6saj5cupjfu

Rating Algorithm for Pronunciation of English Based on Audio Feature Pattern Matching

Kun Li, Jing Li, Yufang Song, Hewei Fu, J.Y. Li, T.Y. Liu, T. Deng, M. Tian
2015 MATEC Web of Conferences  
After processing, the original audio sequence can generate a new audio signal feature.  ...  Weiqian Liang has established a pronunciation quality evaluating system model of English learning system based on the combination of pronunciation quality, pronouncing network generation, evaluating model  ...  The function expressions of filter are as follows:  ... 
doi:10.1051/matecconf/20152201032 fatcat:njqrl5vbfzbwfj2owzzmc2o23e

Page 259 of SMPTE Motion Imaging Journal Vol. 100, Issue 4 [page]

1991 SMPTE Motion Imaging Journal  
TP-3) to its line of digital and audio switching products. The product provides high-quality in- terface between analog audio and the AES/EBU digital audio standard.  ...  Two new products were introduced by Dolby Laboratories. Model DP501/DP502 digital audio coding systems (Fig. TP-4) provide profes- sional-quality audio at 128 kbits/sec/ channel.  ... 

Synchronized and noise-robust audio recordings during realtime magnetic resonance imaging scans

Erik Bresch, Jon Nielsen, Krishna Nayak, Shrikanth Narayanan
2006 Journal of the Acoustical Society of America  
The audio setup itself features two fiber optical microphones and a noise-canceling filter.  ...  Two noise cancellation methods are described including a novel approach using a pulse sequence specific model of the gradient noise of the MRI scanner.  ...  In the first postprocessing step, low-pass filtering and decimation of the audio data to a sampling frequency of 20 kHz is carried out.  ... 
doi:10.1121/1.2335423 pmid:17069275 pmcid:PMC1800830 fatcat:cmttma6f2rfrzlv2fpjseuyw7u

D3.2: Implementation And Documentation Of Reverberation For Object-Based Audio Broadcasting

Markus Noisternig, Thibaut Carpentier, Matthias Geier, Olivier Warusfel
2016 Zenodo  
This document discusses methods for the representation of reverberation in object-based broadcast and also partly describes an example implementation.  ...  However, geometrical room acoustic models are rarely implemented in audio post-production tools for broadcast. 2.1.3 Perceptually-motivated simulation model Figure 5 depicts a generic simplified space-time-frequency  ...  device to device); consequently, the sound engineer loses control over a critical part of audio production.  ... 
doi:10.5281/zenodo.844011 fatcat:plvpf2hp6vfa7dtw2t6z3mtnjy

Modeling of nonlinear audio effects with end-to-end deep neural networks [article]

Marco A. Martínez Ramirez, Joshua D. Reiss
2019 arXiv   pre-print
In this work, we investigate deep learning architectures for audio processing and we aim to find a general purpose end-to-end deep neural network to perform modeling of nonlinear audio effects.  ...  In the context of music production, distortion effects are mainly used for aesthetic reasons and are usually applied to electric musical instruments.  ...  CONCLUSION In this work, we introduced a general purpose deep learning architecture for audio processing in the context of nonlinear modeling.  ... 
arXiv:1810.06603v2 fatcat:vamhnq7lsvgslmrse33tmnsuoy

A masking-threshold-adapted weighting filter for excitation search

Wen-Whei Chang, Chin-Tun Wang
1996 IEEE Transactions on Speech and Audio Processing  
Simulation results indicate that the combined use of a multisinusoid excitation model and a masking-threshold-adapted weighting filter allows the implementation of an LPC-based audio coder that delivers  ...  In this paper, we report on new approaches to exploiting the masking threshold in the design of a perceptual noise-weighting filter for excitation searches.  ...  Snyder, for their careful readings of this paper and their constructive suggestions. They also acknowledge Li-Wei Wang for carrying out the multisinusoid excitation model experiments.  ... 
doi:10.1109/89.486062 fatcat:czvzjbudlrav7ou3l4szgg7wta

A Noninvasive Brain-Computer Interface for Real-Time Speech Synthesis: The Importance of Multimodal Feedback

Jonathan S. Brumberg, Kevin M. Pitt, Jeremy D. Burnison
2018 IEEE transactions on neural systems and rehabilitation engineering  
Over a three-session training period, sixteen participants learned to control the BCI for production of three vowel sounds (/ textipa i/ [heed], / textipa A/ [hot], and / textipa u/ [who'd]) and were split  ...  Audio feedback was provided by a formant frequency artificial speech synthesizer, and visual feedback was given as a 2-D cursor on a graphical representation of the plane defined by the first two formant  ...  BCI performance 1 ) 1 Accuracy and endpoint distance-A generalized linear mixed model, with a logit link function, was used to assess the effects of feedback type, runs, and sessions on BCI production  ... 
doi:10.1109/tnsre.2018.2808425 pmid:29641392 pmcid:PMC5906041 fatcat:hj7ek3xr5jdmtjnoxyifcum6ae

Audio Metaphor 2.0: An Improved Classification and Segmentation Pipeline for Generative Sound Design Systems

Joshua Kranabetter, Craig Carpenter, Renaud Bougueng Tchemeube, Philippe Pasquier, Miles Thorogood
2022 Zenodo  
We present a new set of classification and segmentation algorithms as part of Audio Metaphor (AUME), a generative system for creating novel soundscape compositions.  ...  Building off previous work, we implemented a new audio feature extractor and conducted experiments to test the accuracy of the updated system.  ...  Acknowledgments We would like to acknowledge the National Science and Engineering Research Council of Canada, and the Social Sciences and Humanities Research Council of Canada for their ongoing financial  ... 
doi:10.5281/zenodo.6573410 fatcat:dm5qqvy4fzdb7a5tazzx33vu44

An Electromagnetic Lock Actuated by a Mobile Phone Equipped with a Self-Made Laser Pointer

Jau-Woei Perng, Tung-Li Hsieh
2019 Electronics  
The laser pointer (wavelength of 630–650 nm and maximum output of 5 mw) lights up when the smart phone's music starts playing at a music frequency matching the light frequency.  ...  The main purpose of this study was to create an acousto-optic control lock device to convert electrical signals with a specific sound command using an acousto-optic conversion module, thereby improving  ...  Conflicts of Interest: The authors declare no conflict of interest.  ... 
doi:10.3390/electronics8121524 fatcat:q336e75t2fcjhg7ovitclysyvq
« Previous Showing results 1 — 15 out of 66,613 results