Foreground speech segmentation and enhanecement

dc.contributor.authorDeepak, K. T.
dc.date.accessioned2016-12-19T06:00:48Z
dc.date.accessioned2023-10-20T07:28:23Z
dc.date.available2016-12-19T06:00:48Z
dc.date.available2023-10-20T07:28:23Z
dc.date.issued2016
dc.descriptionSupervisor: S. R. M. Prasannaen_US
dc.description.abstractSpeech enhancement is one of the active areas of research and a challenging task when the signal is recorded in natural environments. In a typical recording scenario using a single microphone, it is safe to assume that the desired speaker is closer to the microphone sensor, relative to other interfering acoustic sources. In this work, the speech signal from close speaking person is regarded as foreground speech and rest of the interfering sources as {\it background noise}. Due to the close proximity of the desired speaker to the microphone, compared to other background sources, there are differences in the signal characteristics. When the speech signal is recorded in natural environments, the production characteristics tend to vary depending on the levels of interfering sources. The objective of this thesis work is to exploit such unique characteristics of speech production to temporally segment foreground speech from rest of the background and further enhance it. The high signal to noise ratio (SNR) regions of foreground speech are robust to interfering noise. The high SNR region around glottal closure instants (GCIs) in the time domain and vocal tract information in the spectral domain is used to derive certain features to segment and enhance foreground speech.en_US
dc.identifier.otherROLL NO. 10610204
dc.identifier.urihttps://gyan.iitg.ac.in/handle/123456789/777
dc.language.isoenen_US
dc.relation.ispartofseriesTH-1527;
dc.subjectELECTRONICS AND ELECTRICAL ENGINEERINGen_US
dc.titleForeground speech segmentation and enhanecementen_US
dc.typeThesisen_US
Files
Original bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
Abstract-TH -1527_10610204.pdf
Size:
96.6 KB
Format:
Adobe Portable Document Format
Description:
Abstract
No Thumbnail Available
Name:
TH -1527_10610204.pdf
Size:
16.52 MB
Format:
Adobe Portable Document Format
Description:
Thesis
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description: