Speaker Verification Under Degraded Conditions Using Vowel-Like and Nonvowel-Like Regions

Pradhan, Gayadhar

Speaker Verification Under Degraded Conditions Using Vowel-Like and Nonvowel-Like Regions

dc.contributor.author	Pradhan, Gayadhar
dc.date.accessioned	2015-09-21T07:12:46Z
dc.date.accessioned	2023-10-20T07:28:03Z
dc.date.available	2015-09-21T07:12:46Z
dc.date.available	2023-10-20T07:28:03Z
dc.date.issued	2013
dc.description	Prasanna, S R Mahadeva	en_US
dc.description.abstract	This thesis proposes a speaker verification system by independent processing of vowel-like regions (VLRs) and non-vowel-like regions (non-VLRs) for achieving better SV perfor- mance under clean and degraded conditions. VLRs are defined as the speech regions belonging to vowels, diphthongs and semivowels, and rest of the consonants as non-VLRs. Methods are proposed for detecting VLRs and non-VLRs using excitation source informa- tion. The VLR onset point (VLROPs) and end points (VLREPs) are hypothesized and used in an iterative algorithm for detecting the VLRs. Next, for detection of non-VLRs, the linear prediction (LP) residual samples in the VLRs are attenuated significantly to indirectly emphasize the residual samples in the non-VLRs. The modified LP residual samples excite the time varying all pole filter to reconstruct non-VLRs enhanced speech and used for detecting non-VLRs. For any practical application of a text-independent speaker verification (SV) system, along with phonetic variability, the speech signal may be affected by background noise, sensor mismatch and channel mismatch. To reduce the effect of these variabilities, three different methods are proposed for processing the VLRs and non-VLRs during training and testing of a SV system. First, a SV system is developed by using only the VLRs to demonstrate the significance of the VLRs for SV under degraded conditions. Then, the VLRs and non- VLRs are used independently during training and testing of a SV system, and the scores are combined with higher weight on VLRs, for a better SV system under clean and degraded conditions. Finally, a SV system is developed by implicit modeling of VLRs and non-VLRs information to reduce the computational complexity involved in the explicit segmentation of these regions. The experimental results presented in this thesis work shows that the VLRs are more speaker specific and relatively less affected under degraded conditions. A better SV system can be developed under clean and degraded conditions by independent processing of VLRs and non-VLRs with emphasis on the VLRs.	en_US
dc.identifier.other	ROLL NO.09610214
dc.identifier.uri	https://gyan.iitg.ac.in/handle/123456789/424
dc.language.iso	en	en_US
dc.relation.ispartofseries	TH-1196;
dc.subject	ELECTRONICS AND ELECTRICAL ENGINEERING	en_US
dc.title	Speaker Verification Under Degraded Conditions Using Vowel-Like and Nonvowel-Like Regions	en_US
dc.type	Thesis	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: TH-1196_09610214.pdf
Size:: 10.09 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Plain Text
Description:

Download

Collections

PhD Theses (Electronics and Electrical Engineering)