Speech knowledge based broadcast audio classification and phone recognition

Khonglah, Banriskhem K

Speech knowledge based broadcast audio classification and phone recognition

Files

Abstract-TH-1627_126102033.pdf (59.73 KB)

TH-1627_126102033.pdf (11.21 MB)

Date

2017

Authors

Khonglah, Banriskhem K

Abstract

In this work an alternate approach for automatic transcription of the anchor speakers' speech in broadcast audio for Indian News Channels is proposed. Some preprocessing methods are required in order to take care of the presence of scenarios like speech with background music and pure music which correspond to the news headlines and voiceover, in addition to the clean speech. The first preprocessing task is the speech versus music classification module which involves the use of speech specific features for classification. The speech output may have some of the segments containing background music, due to the speech specific nature of the features. These segments are then passed through clean speech versus speech with background music classification module which involves the use of the features based on the average and relative characteristics of the vocal tract system. The speech with background music is enhanced using the temporal, spectral processing and perceptual methods where the source information is mostly exploited to obtain the enhanced speech. Finally the clean and enhanced speech segments are passed through the phone recognition system and the final output will be the transcription of clean and enhanced speech, with improved accuracy compared to directly passing the broadcast audio through the phone recognition system.

Description

Supervisor: Prasanna, S R Mahadeva

Keywords

ELECTRONICS AND ELECTRICAL ENGINEERING

URI

https://gyan.iitg.ac.in/handle/123456789/875

Collections

PhD Theses (Electronics and Electrical Engineering)

Full item page

Gyan-IR

Speech knowledge based broadcast audio classification and phone recognition

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections

Endorsement

Review

Supplemented By

Referenced By