Spectral Analysis of Stressed Speech for Speech Recognition

dc.contributor.authorShukla, Sumitra
dc.date.accessioned2015-09-23T07:08:57Z
dc.date.accessioned2023-10-20T07:28:07Z
dc.date.available2015-09-23T07:08:57Z
dc.date.available2023-10-20T07:28:07Z
dc.date.issued2014
dc.descriptionSupervisor: S. Dandapat and S.R. Mahadeva Prasannaen_US
dc.description.abstractThe objective of this thesis is to analyze the stress information in the spectral features of stressed speech. The analysis of stress is focused in the frequency domain, with specific emphasis on various sub-areas in representing this structure-spectrum, subband, and cepstrum. The investigation of stress information includes recognition of speech under stressed condition. In this thesis, four problems of stressed speech recognition are dealt. The first problem deals with the development and evaluation of a stressed speech database. The stress and speech information present in the database are validated by evaluating the stress class and speech information present in the utterances. The stress and speech information are evaluated perceptually as well as by using automatic methods for stress classification and speech recognition, respectively. Under stressed condition, migration of spectral energy takes place from the lower frequency to the higher frequency. The migration of spectral energy effects the spectral tilt and the subband energy of the speech signal. This has been reported in the literature. Compared to the source, the formants are less affected due to stress. As a part of the second problem, this has been revisited. The conventional method for computation of spectral tilt captures the gross spectral energy information of the speech signal. In the present work, relative formant peak displacement (RFD) is proposed to quantify this variation in formant peaks. The RFD values of second, third and fourth formant peaks are computed as relative displacements of these formant peaks from the first formant peak. A stress classifier is developed to investigate the stress information in the RFD feature..en_US
dc.identifier.otherROLL NO. 07610203
dc.identifier.urihttps://gyan.iitg.ac.in/handle/123456789/535
dc.language.isoenen_US
dc.relation.ispartofseriesTH-1325;
dc.subjectELECTRONICS AND ELECTRICAL ENGINEERINGen_US
dc.titleSpectral Analysis of Stressed Speech for Speech Recognitionen_US
dc.typeThesisen_US
Files
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
TH-1325_07610203.pdf
Size:
9.92 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description: