Lightweight Deep Learning Architectures for Semantic Scene Segmentation for Applications in Autonomous Driving

dc.contributor.authorMazhar, Saquib
dc.date.accessioned2025-12-16T11:24:54Z
dc.date.issued2025
dc.descriptionSupervisors: Bhuyan, M K and Ahamed, Shaik Rafi
dc.description.abstractSemantic segmentation, which assigns a class label to each pixel in an image, is fundamental to autonomous driving systems that must perceive and understand complex environments in real time. Achieving high segmentation accuracy while maintaining computational efficiency remains a major challenge, particularly for embedded systems with limited resources. This thesis addresses these challenges by proposing novel methods that enhance both segmentation performance and efficiency. A key contribution is a composite loss function that integrates cross-entropy, boundary, and region-based losses to improve accuracy around object edges and better handle small and infrequent classes. This leads to improved segmentation results, especially for critical but underrepresented objects such as pedestrians and traffic lights. To support real-time operation, this work introduces lightweight network architectures. The Inverse esidual Dilation Pyramid Network (IRDPNet) employs efficient bottlenecks and multi-scale dilation to significantly reduce model size while preserving accuracy. The Block Attention Network (BANet) further enhances contextual understanding by integrating a modified attention mechanism that captures long-range dependencies with minimal overhead. Additionally, the Context-Guided Multi-scale Attention Network (CGMANet) is proposed to combine both low-level spatial features and high-level semantic cues through a hybrid attention mechanism. This architecture effectively balances detail preservation and context awareness, making it highly suitable for deployment on embedded platforms. Collectively, these contributions offer practical solutions for real-time, accurate semantic segmentation in autonomous driving scenarios.
dc.identifier.otherROLL NO.186102113
dc.identifier.urihttps://gyan.iitg.ac.in/handle/123456789/3073
dc.language.isoen
dc.relation.ispartofseriesTH-3672
dc.rightshttps://creativecommons.org/licenses/by-nc-sa/4.0/
dc.rights.urihttps://creativecommons.org/licenses/by-nc-sa/4.0/
dc.titleLightweight Deep Learning Architectures for Semantic Scene Segmentation for Applications in Autonomous Driving
dc.typeThesis

Files

Original bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
Abstract-TH-3672_186102113.pdf
Size:
104.38 KB
Format:
Adobe Portable Document Format
Description:
ABSTRACT
Loading...
Thumbnail Image
Name:
TH-3672_186102113.pdf
Size:
4.92 MB
Format:
Adobe Portable Document Format
Description:
THESIS

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
227 B
Format:
Item-specific license agreed to upon submission
Description: