Automatic Dialect Identification in Ao, a Low Resource Language

Tzudir, Moakala

Automatic Dialect Identification in Ao, a Low Resource Language

dc.contributor.author	Tzudir, Moakala
dc.date.accessioned	2024-08-01T09:47:12Z
dc.date.available	2024-08-01T09:47:12Z
dc.date.issued	2023
dc.description	Prasanna, S R Mahadeva	en_US
dc.description	Sarmah, Priyankoo
dc.description.abstract	Dialect Identification (DID) is a significant research problem widely explored in major languages like Arabic, Chinese, and Spanish. DID can serve as a frontend for many applications like Automatic Speech Recognition (ASR) that may require special dialect-specific enhancements for improved performance. This thesis proposes an automatic DID system for Ao, an under-resourced language of India. Ao is a Tibeto-Burman language spoken in Nagaland. It is a tonal language with three lexical tones: high, mid, and low. Chungli, Mongsen, and Changki are the three dialects of Ao that differ in their respective tone assignment on lexical words. Four principal contributions are made in this thesis. The first contribution of this thesis is creating a manually collected and annotated novel speech dataset to foster research on the Ao language. The second contribution of the thesis is a detailed acoustic study of the unexplored tone dynamics of the dialects of Ao. Based on the analysis, a tonal feature ($F_0$) to capture the dialect-specific tone information is proposed. The DID performance improves when the proposed tonal feature is combined with other spectral features. As the third contribution, this thesis explores three excitation source features in the DID task. The source features studied are Residual Mel Frequency Cepstral Coefficient (RMFCC), Integrated Linear Prediction Residual Log Mel Spectrogram (ILPR-LMS), and Linear Prediction (LP)-gammatonegram. A notable performance improvement is observed when the source information is combined with the vocal tract information. The fourth contribution of this thesis is the exploration of prosody-related characteristics of speech signals. The prosodic features are observed to provide significant performance improvements in classifying the dialects of Ao. The thesis work is concluded by combining all the proposed approaches to build an efficient DID system for Ao. Among many hurdles in studying under-resourced languages like Ao, the need for more data is the most prominent. Nevertheless, the contributions of this thesis may bridge some of those gaps and spur future research in this direction.	en_US
dc.identifier.other	ROLL NO.156102018
dc.identifier.uri	https://gyan.iitg.ac.in/handle/123456789/2682
dc.language.iso	en	en_US
dc.relation.ispartofseries	TH-2975;
dc.subject	Ao	en_US
dc.subject	Tonal Language	en_US
dc.subject	Low-resource Language	en_US
dc.subject	Dialect Identification	en_US
dc.subject	Tonal Features	en_US
dc.subject	Excitation Source Features	en_US
dc.subject	Prosodic Features	en_US
dc.title	Automatic Dialect Identification in Ao, a Low Resource Language	en_US
dc.type	Thesis	en_US

Files

Original bundle

Now showing 1 - 2 of 2

Name:: Abstract-TH-2975_156102018.pdf
Size:: 98.08 KB
Format:: Adobe Portable Document Format
Description:: ABSTRACT

Download

Name:: TH-2975_156102018.pdf
Size:: 9.55 MB
Format:: Adobe Portable Document Format
Description:: THESIS

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed to upon submission
Description:

Download

Collections

PhD Theses (Electronics and Electrical Engineering)