Speech knowledge based broadcast audio classification and phone recognition

dc.contributor.authorKhonglah, Banriskhem K
dc.date.accessioned2017-11-28T09:53:34Z
dc.date.accessioned2023-10-20T07:28:51Z
dc.date.available2017-11-28T09:53:34Z
dc.date.available2023-10-20T07:28:51Z
dc.date.issued2017
dc.descriptionSupervisor: S. R. M. Prasannaen_US
dc.description.abstractIn this work an alternate approach for automatic transcription of the anchor speakers' speech in broadcast audio for Indian News Channels is proposed. Some preprocessing methods are required in order to take care of the presence of scenarios like speech with background music and pure music which correspond to the news headlines and voiceover, in addition to the clean speech. The first preprocessing task is the speech versus music classification module which involves the use of speech specific features for classification. The speech output may have some of the segments containing background music, due to the speech specific nature of the features. These segments are then passed through clean speech versus speech with background music classification module which involves the use of the features based on the average and relative characteristics of the vocal tract system. The speech with background music is enhanced using the temporal, spectral processing and perceptual methods where the source information is mostly exploited to obtain the enhanced speech. Finally the clean and enhanced speech segments are passed through the phone recognition system and the final output will be the transcription of clean and enhanced speech, with improved accuracy compared to directly passing the broadcast audio through the phone recognition system.en_US
dc.identifier.otherROLL No. 126102033
dc.identifier.urihttps://gyan.iitg.ac.in/handle/123456789/875
dc.language.isoenen_US
dc.relation.ispartofseriesTH-1627;
dc.subjectELECTRONICS AND ELECTRICAL ENGINEERINGen_US
dc.titleSpeech knowledge based broadcast audio classification and phone recognitionen_US
dc.typeThesisen_US
Files
Original bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
Abstract-TH-1627_126102033.pdf
Size:
59.73 KB
Format:
Adobe Portable Document Format
Description:
Abstract
No Thumbnail Available
Name:
TH-1627_126102033.pdf
Size:
11.21 MB
Format:
Adobe Portable Document Format
Description:
Thesis
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description: