Shouted, Overlapped and Competitive Speech Detection in Indian Television News Debates

No Thumbnail Available
Date
2022
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
Television (TV) news debates present expert opinions, analysis and discussions on contemporary events. These debates play a critical role in navigating public belief and understanding of socio-politically relevant topics. This encourages several agencies to analyze the TV news debate content for monitoring their influence. The availability of huge (and ever increasing) amount of news debate data calls for the necessity of automatic content analysis. TV news debates are generally argumentative in nature. Such arguments are often associated with the presence of shouted, overlapped, and competitive speech. In this context, the present thesis aims to detect these three speech categories in Indian TV news debates. The first contribution of this thesis is the development of an Indian Broadcast News Debate (IBND) corpus containing audio signals from 15 news debates (approximately 13 hours). A multi-level annotation procedure was followed to obtain the final annotations for the three targeted tasks of the thesis. The second contribution lies in the proposal of excitation source based Shouted Speech Detection (SSD). Both handcrafted and learned features from excitation source-based representations are explored for SSD. An autoencoder with Bi-GRU based architecture is used as classifier. The third aim of the thesis is to identify the overlapped speech in TV news debates. Phase-based representations of the speech signals are established as efficient features for Overlapped Speech Detection (OSD) using a CNN-LSTM based classifier. Finally, the shouted and overlapped speech classification network embeddings and their prediction scores are used as features to identify the competitive speech. It has been shown that the detection of competitive speech can be performed efficiently using high-level information of both shouted and overlapped speech.
Description
Supervisors: Guha, Prithwijit and Prasanna, S R Mahadeva
Keywords
Indian Broadcast News Debates, Shouted Speech, Overlapped Speech, Competitive Speech, Indian Broadcast News Debate Corpus, Excitation Source Features, Phase Features, Deep Learning
Citation