Detecting voice disguise from speech variability: Analysis of three glottal and vocal tract measures

Talal B. Amin, James S. German, Pina Marziliano
2013 Journal of the Acoustical Society of America  
Corresponding author's address: S chool of Electrical and Electronic Engineering, Nanyang Technological University, S ingapore, Media Technology Lab, S 2.2-B4-02, S ingapore, 649491, S ingapore, S ingapore, talal1@e.ntu.edu.sg The deliberate attempt by speakers to conceal their identity (voice disguise) presents a challenge for forensics and for automated speaker identification systems. Using a database of natural and disguised voices of three professional voice artists, we build on earlier
more » ... uild on earlier findings in by exploring how certain glottal and vocal tract measures, such as glottal timing (Open Quotient) and vowel formants are manipulated by the artists to create novel voice identities. We also investigate whether there are any features from these measures that can be useful for discriminating natural and disguised voices. As expected, variation in Open Quotient was speaker-dependent, and corresponded closely to social attributes (i.e., age) of the voice identities involved. By modelling the overall variability of speakers in the vowel space, we propose a new method for automatically classifying natural and disguised voices. The proposed method is found to outperform several state-of-the-art methods.
doi:10.1121/1.4830853 fatcat:ej7hzf6atrbvzdsksvepbojf7m