Lightweight Deep Learning Architectures for Semantic Scene Segmentation for Applications in Autonomous Driving

Mazhar, Saquib

Lightweight Deep Learning Architectures for Semantic Scene Segmentation for Applications in Autonomous Driving

dc.contributor.author	Mazhar, Saquib
dc.date.accessioned	2025-12-16T11:24:54Z
dc.date.issued	2025
dc.description	Bhuyan, M K
dc.description	Ahamed, Shaik Rafi
dc.description.abstract	Semantic segmentation, which assigns a class label to each pixel in an image, is fundamental to autonomous driving systems that must perceive and understand complex environments in real time. Achieving high segmentation accuracy while maintaining computational efficiency remains a major challenge, particularly for embedded systems with limited resources. This thesis addresses these challenges by proposing novel methods that enhance both segmentation performance and efficiency. A key contribution is a composite loss function that integrates cross-entropy, boundary, and region-based losses to improve accuracy around object edges and better handle small and infrequent classes. This leads to improved segmentation results, especially for critical but underrepresented objects such as pedestrians and traffic lights. To support real-time operation, this work introduces lightweight network architectures. The Inverse esidual Dilation Pyramid Network (IRDPNet) employs efficient bottlenecks and multi-scale dilation to significantly reduce model size while preserving accuracy. The Block Attention Network (BANet) further enhances contextual understanding by integrating a modified attention mechanism that captures long-range dependencies with minimal overhead. Additionally, the Context-Guided Multi-scale Attention Network (CGMANet) is proposed to combine both low-level spatial features and high-level semantic cues through a hybrid attention mechanism. This architecture effectively balances detail preservation and context awareness, making it highly suitable for deployment on embedded platforms. Collectively, these contributions offer practical solutions for real-time, accurate semantic segmentation in autonomous driving scenarios.
dc.identifier.other	ROLL NO.186102113
dc.identifier.uri	https://gyan.iitg.ac.in/handle/123456789/3073
dc.language.iso	en
dc.relation.ispartofseries	TH-3672
dc.rights	https://creativecommons.org/licenses/by-nc-sa/4.0/
dc.rights.uri	https://creativecommons.org/licenses/by-nc-sa/4.0/
dc.title	Lightweight Deep Learning Architectures for Semantic Scene Segmentation for Applications in Autonomous Driving
dc.type	Thesis