Filters








41,102 Hits in 6.2 sec

Speaker Identity and Voice Quality: Modeling Human Responses and Automatic Speaker Recognition

Soo Jin Park, Caroline Sigouin, Jody Kreiman, Patricia Keating, Jinxi Guo, Gary Yeung, Fang-Yu Kuo, Abeer Alwan
<span title="2016-09-08">2016</span> <i title="ISCA"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/trpytsxgozamtbp7emuvz2ypra" style="color: black;">Interspeech 2016</a> </i> &nbsp;
Despite recent breakthroughs in automatic speaker recognition (ASpR), system performance still degrades when utterances are short and/or when within-speaker variability is large.  ...  This study used short test utterances (2-3sec) to investigate the effect of within-speaker variability on state-of-the-art ASpR system performance.  ...  Because automatic speaker recognition (ASpR) systems are sensitive to the utterance length of the enrollment and test data, it is important to balance the amount of data for a fair comparison.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.21437/interspeech.2016-523">doi:10.21437/interspeech.2016-523</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/interspeech/ParkSKKGYKA16.html">dblp:conf/interspeech/ParkSKKGYKA16</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/r5vovi6llzcildqgdonkuqpz5q">fatcat:r5vovi6llzcildqgdonkuqpz5q</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190218153438/http://pdfs.semanticscholar.org/0d1b/57204065ddf92828a1fcf74c1235c15fbf11.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/0d/1b/0d1b57204065ddf92828a1fcf74c1235c15fbf11.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.21437/interspeech.2016-523"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> Publisher / doi.org </button> </a>

Recognizing the message and the messenger: biomimetic spectral analysis for robust speech and speaker recognition

Sridhar Krishna Nemala, Kailash Patil, Mounya Elhilali
<span title="2012-12-18">2012</span> <i title="Springer Nature"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/xeix3cke4vgplj7qtbgmiixzka" style="color: black;">International Journal of Speech Technology</a> </i> &nbsp;
significant improvements over a state-of-the-art noise robust feature scheme, on both speech and speaker recognition tasks.  ...  However most speech processing systems, like automatic speech and speaker recognition systems, suffer from a significant drop in performance when speech signals are corrupted with unseen background distortions  ...  All statements of fact, opinion or conclusions contained herein are those of the authors and should not be construed as representing the official views or policies of IARPA, the ODNI, or the U.S.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/s10772-012-9184-y">doi:10.1007/s10772-012-9184-y</a> <a target="_blank" rel="external noopener" href="https://www.ncbi.nlm.nih.gov/pubmed/26412979">pmid:26412979</a> <a target="_blank" rel="external noopener" href="https://pubmed.ncbi.nlm.nih.gov/PMC4579853/">pmcid:PMC4579853</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/3pdgp2tmsrbyleuw34vylnu3ay">fatcat:3pdgp2tmsrbyleuw34vylnu3ay</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20171012044717/http://publisher-connector.core.ac.uk/resourcesync/data/Springer-OA/pdf/48b/aHR0cDovL2xpbmsuc3ByaW5nZXIuY29tLzEwLjEwMDcvczEwNzcyLTAxMi05MTg0LXkucGRm.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/e4/07/e407e38bc238cc08842c621b69213268f579b85e.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/s10772-012-9184-y"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> springer.com </button> </a> <a target="_blank" rel="external noopener" href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4579853" title="pubmed link"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> pubmed.gov </button> </a>

Forensically inspired approaches to automatic speaker recognition

K. J. Han, M. K. Omar, J. Pelecanos, C. Pendus, S. Yaman, W. Zhu
<span title="">2011</span> <i title="IEEE"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/rc5jnc4ldvhs3dswicq5wk3vsq" style="color: black;">2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)</a> </i> &nbsp;
This paper presents ongoing research leveraging forensic methods for automatic speaker recognition.  ...  Other approaches have also involved performing a phonetic analysis to recognize idiolectal attributes, and an implicit analysis of the demographics of speakers.  ...  We also overview the use of features other than standard cepstral coef cients for voice comparison.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/icassp.2011.5947519">doi:10.1109/icassp.2011.5947519</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/icassp/HanOPPYZ11.html">dblp:conf/icassp/HanOPPYZ11</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/jiryxufhafbghpadppy7xdxl2u">fatcat:jiryxufhafbghpadppy7xdxl2u</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170812151248/http://www.mirlab.org/conference_papers/International_Conference/ICASSP%202011/pdfs/0005160.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/d1/27/d127c90f61fcd92e3f3a076d5c763114e34768a7.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/icassp.2011.5947519"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

A Paralinguistic Approach To Speaker Diarisation

Yue Zhang, Felix Weninger, Boqing Liu, Maximilian Schmitt, Florian Eyben, Björn Schuller
<span title="">2017</span> <i title="ACM Press"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/lahlxihmo5fhzpexw7rundu24u" style="color: black;">Proceedings of the 2017 ACM on Multimedia Conference - MM &#39;17</a> </i> &nbsp;
In this work, we present a new view on automatic speaker diarisation, i. e., assessing "who speaks when", based on the recognition of speaker traits such as age, gender, voice likability, and personality  ...  Our results provide clear evidence that using paralinguistic features for speaker diarisation is a promising avenue of research.  ...  [9, 10] . e ComParE set is a well-evolved feature set for automatic recognition of paralinguistic speech phenomena, serving as a standard reference in the speech community.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/3123266.3123338">doi:10.1145/3123266.3123338</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/mm/0014WLSES17.html">dblp:conf/mm/0014WLSES17</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/44wf5bpjofcxhpowgvzfcxcmwy">fatcat:44wf5bpjofcxhpowgvzfcxcmwy</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190218140425/https://static.aminer.org/pdf/20170130/pdfs/mm/lrni8zi1olxes6jkcpaxtbwbuyaj7qyo.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/1f/51/1f518e324e84b0dfa6a93148ecba707b8a62813e.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/3123266.3123338"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> acm.org </button> </a>

Performance Analysis of Compressed-Domain Automatic Speaker Recognition as a Function of Speech Coding Technique and Bit Rate

M. Petracca, A. Servetti, J.c. Martin
<span title="">2006</span> <i title="IEEE"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/pmefrmqsezb5zd3w7pf5k7fmxu" style="color: black;">2006 IEEE International Conference on Multimedia and Expo</a> </i> &nbsp;
Compressed-domain automatic speaker recognition is based on the analysis of the compressed parameters of speech coders.  ...  The objective is to perform low-complexity on-line speaker recognition for VoIP in the compressed domain, without the need to decode or resynthesize the speech bitstream.  ...  Table 2 shows a comparison of the results obtained by the speaker recognition algorithm for different speech formats.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/icme.2006.262799">doi:10.1109/icme.2006.262799</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/icmcs/PetraccaSM06.html">dblp:conf/icmcs/PetraccaSM06</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/5xeknzjcavcg7cli4ew3txqgmi">fatcat:5xeknzjcavcg7cli4ew3txqgmi</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20151023182822/http://media.polito.it/wordpress/wp-content/uploads/2011/02/petracca_servetti_icme2006_0.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/c8/f8/c8f84c7c3991a4144b32e2ccc5617a315033fb33.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/icme.2006.262799"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

Local Feature based Gender Independent Bangla ASR

Bulbul Ahamed, Khaled Mahmud, B.K.M. Mizanur, Foyzul Hassan, Rasel Ahmed, Mohammad Nurul
<span title="">2012</span> <i title="The Science and Information Organization"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/q5sqoxqjfvgmlps7hcahb76xhi" style="color: black;">International Journal of Advanced Research in Artificial Intelligence (IJARAI)</a> </i> &nbsp;
Speaker-specific characteristics play an important role on the performance of Bangla automatic speech recognition (ASR).  ...  This paper presents an automatic speech recognition (ASR) for Bangla (widely used as Bengali) by suppressing the speaker gender types based on local features extracted from an input speech.  ...  Since the local features www.ijarai.thesai.org incorporate frequency and time domain information, it shows significant improvement of recognition performance over the method based on MFCCs at fewer mixture  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.14569/ijarai.2012.010807">doi:10.14569/ijarai.2012.010807</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/qeq2rs65rzdhvj4zlhprl5fj5a">fatcat:qeq2rs65rzdhvj4zlhprl5fj5a</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170829185837/http://www.thesai.org/Downloads/IJARAI/Volume1No8/Paper_7_Local_Feature_based_Gender_Independent_Bangla_ASR.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/21/50/215090120e4e5e29f518493b9f147365ed06b501.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.14569/ijarai.2012.010807"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> Publisher / doi.org </button> </a>

Automatic intelligibility assessment of pathologic speech over the telephone

Tino Haderlein, Elmar Nöth, Anton Batliner, Ulrich Eysholdt, Frank Rosanowski
<span title="2011-08-30">2011</span> <i title="Informa UK Limited"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/fwtsn37skvadhils5ojlkaabsq" style="color: black;">Logopedics, Phoniatrics, Vocology</a> </i> &nbsp;
Objective evaluation was performed by Support Vector Regression on the word accuracy (WA) and word correctness (WR) of a speech recognition system, and a set of prosodic features.  ...  It consists of WR, the average duration of the silent pauses before a word, the standard deviation of the fundamental frequency on the entire sample, the standard deviation of jitter, and the ratio of  ...  Acknowledgments We would like to thank Maria Schuster, Eva Uhl, Florian Hebel, and the speech therapists of the Department of Phoniatrics and Pediatric Audiology for obtaining the audio and perceptual  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.3109/14015439.2011.607470">doi:10.3109/14015439.2011.607470</a> <a target="_blank" rel="external noopener" href="https://www.ncbi.nlm.nih.gov/pubmed/21875389">pmid:21875389</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/okonbdir6ngang3tr6ep6lvxyy">fatcat:okonbdir6ngang3tr6ep6lvxyy</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190714222024/http://www5.informatik.uni-erlangen.de:80/Forschung/Publikationen/2011/Haderlein11-AIA.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/78/0d/780d2fdd6c2b42e9633bf3997f2215d8bbac1e07.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.3109/14015439.2011.607470"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> Publisher / doi.org </button> </a>

Towards an objective comparison of feature extraction techniques for automatic speaker recognition systems

Ayoub Bouziane, Jamal Kharroubi, Arsalane Zarghili
<span title="2021-02-01">2021</span> <i title="Institute of Advanced Engineering and Science"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/mwj5yys4mnek7al64kpw5psjqi" style="color: black;">Bulletin of Electrical Engineering and Informatics</a> </i> &nbsp;
The aim of the present paper is twofold. Firstly, it aims to review the most significant advancements in feature extraction techniques used for automatic speaker recognition.  ...  A common limitation of the previous comparative studies on speaker-features extraction techniques lies in the fact that the comparison is done independently of the used speaker modeling technique and its  ...  of feature extraction techniques for automatic speaker…(Ayoub Bouziane) The two main categories of speaker features The two main blocks of speaker recognition systems Figure 3 . 3 Timeline view of  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.11591/eei.v10i1.1782">doi:10.11591/eei.v10i1.1782</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/rt3khydcwvdhxlq4gxuhwqiywq">fatcat:rt3khydcwvdhxlq4gxuhwqiywq</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20201023032103/https://beei.org/index.php/EEI/article/download/1782/1769" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/23/cb/23cbc9cab45b03ca2ffc5288fc90f9d269914b97.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.11591/eei.v10i1.1782"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> Publisher / doi.org </button> </a>

Investigating the Impact of Language Style and Vocal Expression on Social Roles of Participants in Professional Meetings

Ashtosh Sapru, Herve Bourlard
<span title="">2013</span> <i title="IEEE"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/n4dsimuzfzhvthzpnxzbyrmhim" style="color: black;">2013 Humaine Association Conference on Affective Computing and Intelligent Interaction</a> </i> &nbsp;
Experiments conducted on almost 12.5 hours of meeting data reveal that recognition system trained using language style features and acoustic features can reach a recognition accuracy of 64% and 68% respectively  ...  Language style features are extracted from automatically generated speech transcripts and characterize word usage in terms of psychologically meaningful categories.  ...  For the acoustic feature set, we performed a comparison study for the relevance of spectral, voice quality features against the standard feature set based on F0 and RMS energy.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/acii.2013.60">doi:10.1109/acii.2013.60</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/acii/SapruB13.html">dblp:conf/acii/SapruB13</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/izgblprherfjngb3f3ceeqoxaq">fatcat:izgblprherfjngb3f3ceeqoxaq</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170705083135/http://publications.idiap.ch/downloads/papers/2013/Sapru_ACII_2013.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/16/7c/167cbba8ad4e030a581dc67722754d8c4138ed3b.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/acii.2013.60"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

A Critical Review on Automatic Speaker Recognition

Nilu Singh
<span title="">2015</span> <i title="Science Publishing Group"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/g3unkpl2o5e25avghg6irio2bi" style="color: black;">Science Journal of Circuits Systems and Signal Processing</a> </i> &nbsp;
Automatic Speaker recognition is a procedure to automatically recognizing a speaker or who is speaking by the individual information counted in speech signal/waves.  ...  Automatic speaker recognition technique makes it possible to use the speaker's speech to verify their identity.  ...  Acknowledgement This work is sponsored by the CST-UP, Lucknow, India, under CST/D-413.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.11648/j.cssp.20150402.12">doi:10.11648/j.cssp.20150402.12</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/qmwjhthdrfc3di4sedquuywywu">fatcat:qmwjhthdrfc3di4sedquuywywu</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170706100051/http://article.sciencepublishinggroup.com/pdf/10.11648.j.cssp.20150402.12.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/f7/4e/f74ee29629234dcb0b83eec258f13573015f0d2a.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.11648/j.cssp.20150402.12"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> Publisher / doi.org </button> </a>

Leveraging inter-rater agreement for audio-visual emotion recognition

Yelin Kim, Emily Mower Provost
<span title="">2015</span> <i title="IEEE"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/n4dsimuzfzhvthzpnxzbyrmhim" style="color: black;">2015 International Conference on Affective Computing and Intelligent Interaction (ACII)</a> </i> &nbsp;
We choose weights of prototypical and non-prototypical instances based on the maximal accuracy of each speaker.  ...  In this paper, we investigate how audiovisual emotion recognition systems can leverage prototypicality, the level of agreement or confusion among human evaluators.  ...  For instance, Eyben et al. have shown that multi-task learning of dimensional emotion labels and inter-rater standard deviation, improves the performance of dimensional emotion label regression tasks over  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/acii.2015.7344624">doi:10.1109/acii.2015.7344624</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/acii/KimP15.html">dblp:conf/acii/KimP15</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/ef5dqdmaovakxplelcogfnvxpy">fatcat:ef5dqdmaovakxplelcogfnvxpy</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20151030224922/http://www-personal.umich.edu/~yelinkim/YKimPapers/YKim2015ACII_Leveraging.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/ca/c3/cac3471b271dc19f589b2c4f62a6a9c13bc1aa36.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/acii.2015.7344624"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

Combining dynamic features with MFCC for text-independent speaker identification

Amol Chaudhari, Amol Rahulkar, S. B. Dhonde
<span title="">2015</span> <i title="IEEE"> 2015 International Conference on Information Processing (ICIP) </i> &nbsp;
We give an overview of both the classical and the state-of-the-art methods. We start with the fundamentals of automatic speaker recognition, concerning feature extraction and speaker modeling.  ...  In this gives an overview of automatic speaker recognition technology, with an emphasis on textindependent recognition. Speaker recognition has been studied actively for several decades.  ...  We use perceptual tests performed by non-experts and compare their performance with that of a baseline automatic speaker recognition system.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/infop.2015.7489370">doi:10.1109/infop.2015.7489370</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/w2uqfyafm5a6netwm3dk7bt2p4">fatcat:w2uqfyafm5a6netwm3dk7bt2p4</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170829165427/https://www.rroij.com/open-access/text-independent-speaker-modeling-andidentification-based-on-mfcc-features.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/e0/d8/e0d8959bb885032ef1c55c9dcb4efa5f03080257.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/infop.2015.7489370"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

Mispronunciation detection based on cross-language phonological comparisons

Lan Wang, Xin Feng, Helen M. Meng
<span title="">2008</span> <i title="IEEE"> 2008 International Conference on Audio, Language and Image Processing </i> &nbsp;
The experiments have examined that the agreement between automatic mispronunciation detection and human judges is over 84% for 21 Cantonese speakers.  ...  This paper presents a method using speech recognition with linguistic constraints to detect the mispronunciations made by Cantonese learners of English.  ...  However, the standard phone recognition systems obtain much higher phone error rate even for the native speaker.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/icalip.2008.4590074">doi:10.1109/icalip.2008.4590074</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/irjopou4gbcnrorvissnx3rjqq">fatcat:irjopou4gbcnrorvissnx3rjqq</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170706063745/http://www1.se.cuhk.edu.hk/%7Ehccl/publications/pub/ICALIP2008_v1.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/e9/24/e924e535a0cd19b542e7c19e0f6ad8fae6f526f0.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/icalip.2008.4590074"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

Speech Recognition Using Matrix Comparison

Vishnupriya Gupta
<span title="">2012</span> <i title="IOSR Journals"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/7ftgkqdzrnfspfyjn3usmdd66q" style="color: black;">IOSR Journal of VLSI and Signal processing</a> </i> &nbsp;
Many speech /voice processing tasks, like speech and word recognition ,reached satisfactory performance levels on specific applications, and although a variety of commercial products were launched in the  ...  Recent technological advances have made recognition of more complex speech patterns possible.[1] Speech /voice recognition is a very difficult task to be performed by a computer system.  ...  , so that it is arguably the most important component of designing an intelligent system based on speech/speaker recognition, since the best classifier will perform poorly if the features are not chosen  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.9790/4200-0114345">doi:10.9790/4200-0114345</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/h2zrqpepjbaofedwoxutdxgl2m">fatcat:h2zrqpepjbaofedwoxutdxgl2m</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20180601215744/http://www.iosrjournals.org/iosr-jvlsi/papers/vol1-issue1/E0114345.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/41/41/4141d67e7b95dccaa14dcdd0f676cc7ed4c05600.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.9790/4200-0114345"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> Publisher / doi.org </button> </a>

Automatic Speech Recognition Systems for the Evaluation of Voice and Speech Disorders in Head and Neck Cancer

Andreas Maier, Tino Haderlein, Florian Stelzle, Elmar Nöth, Emeka Nkenke, Frank Rosanowski, Anne Schützenberger, Maria Schuster
<span title="">2010</span> <i title="Springer Nature"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/tzakietxejgppjzsrojed7bkke" style="color: black;">EURASIP Journal on Audio, Speech, and Music Processing</a> </i> &nbsp;
Intelligibility was quantified by speech recognition on recordings of a standard text read by 41 German laryngectomized patients with cancer of the larynx or hypopharynx and 49 German patients who had  ...  The speech recognition provides the percentage of correctly recognized words of a sequence, that is, the word recognition rate.  ...  The authors are responsible for the content of this article.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1155/2010/926951">doi:10.1155/2010/926951</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/pfdqthpdujduzc34gtg2ra3rua">fatcat:pfdqthpdujduzc34gtg2ra3rua</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170809154846/https://asmp-eurasipjournals.springeropen.com/track/pdf/10.1155/2010/926951?site=asmp.eurasipjournals.springeropen.com" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/74/be/74bed49b22de6feb5ca1e9e90c1901e44aca3ae7.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1155/2010/926951"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> hindawi.com </button> </a>
&laquo; Previous Showing results 1 &mdash; 15 out of 41,102 results