Speech Emotion Recognition with Application to Mental Health: A Tensor Perspective

Pandey, Sandeep Kumar

Speech Emotion Recognition with Application to Mental Health: A Tensor Perspective

Files

Abstract-TH-2937_156302006.pdf (77.93 KB)

TH-2937_156302006.pdf (8.15 MB)

Date

2022

Authors

Pandey, Sandeep Kumar

Abstract

Speech Emotion Recognition (SER) has been an active area of research ever since the need for smooth and natural Human-Computer Interaction (HCI) came into play. This thesis aims to develop an SER system based on an amalgamation of Tensor Factorization and Neural Network-based learning to mitigate several issues while using contemporary deep learning architectures. This, in turn, is helpful towards recognizing the mental health issues such as depression, anxiety, etc., from speech signals as it is shown in the literature that mental health and emotions are highly correlated. As such, this thesis tries to provide techniques to incorporate emotional information to assess mental health conditions from speech signals, thereby helping the psychologists assign a depression score to patients based on their experience and machine-generated score, thereby mitigating any human bias which might creep in human-only situations.

Description

Supervisors: Shekhawat, Hanumant Singh and Prasanna, S R Mahadeva

Keywords

Speech Emotion Recognition, Deep Learning, Tensor Factorization, Mental Health, Depression Diagnosis, Multi-cultural, Fusion, Multi- modal, Multi-task

URI

https://gyan.iitg.ac.in/handle/123456789/2269

Collections

PhD Theses (Electronics and Electrical Engineering)

Full item page

Gyan-IR

Speech Emotion Recognition with Application to Mental Health: A Tensor Perspective

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections

Endorsement

Review

Supplemented By

Referenced By